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Introduction 

The  erbB/HER  receptor  family  mediates  growth  factor  induced  signal 
transduction  across  membranes.  A  complex  network  of  both  homo-  and  hetero- 
oligomeric  interactions  exists  for  the  four  members  of  this  receptor  family  [1]. 
Overexpression  of  the  second  receptor,  erbB2,  is  observed  in  20-30%  of  human  breast 
cancers  and  correlates  with  poor  clinical  prognosis  [2]  [3].  The  goals  of  our  research  are 
to  examine  the  role  played  by  the  transmembrane  domains  of  the  erbB  receptors  in  the 
network  of  receptor-receptor  interactions. 

Body 

We  have  completed  Task  One  and  most  of  Task  2.  Based  on  the  results  in  these 
two  Tasks,  we  have  replaced  Task  3  with  the  development  of  a  theoretical  treatment  of 
protein  distributions  in  micelles.  We  apply  this  theory  to  explain  our  results,  and  to  re¬ 
evaluate  the  conclusions  from  other  studies  in  the  literature. 

The  key  research  accomplishments  with  specific  references  to  the  approved 
“Statement  of  Work"  are  given  below. 

Key  Research  Accomplishments 

Overall  Accomplishments. 

This  was  the  third  year  of  our  grant  award.  In  previous  years,  we  have  cloned, 
expressed, 'purified  and  measured  the  transmembrane  association  energetics  for  the 
human  erbBI,  erbB2,  erbB3  and  erbB4  transmembrane  domains.  We  found  that  all  of 
these  transmembrane  domains  self-associate  in  a  manner  that  is  much  weaker  than  the 
self-association  of  the  glycophorin  A  transmembrane  domain.  The  extent  of  association 
can  be  expressed  as  an  average  molecular  weight  for  each  species,  and  we  found  that 
the  molecular  weights  correlate  well  with  tendencies  for  erbB  transmembrane  domains 
to  self-associate  in  bacterial  membranes.  Thus,  we  can  conclude  that  the  propensities 
for  self-association  we  observe  in  micelles  are  mirrored  in  biological  membranes.  We 
also  carried  out  mixing  experiments  and  found  that  the  transmembrane  domains  do  not 
mediate  strong  hetero-interactions. 

Even  though  some  dimerization  was  observed  for  the  erbB  transmembrane 
domains,  the  modest  association  we  observed  was  of  concern  to  us  due  to  the 
possibility  for  “artifactua!  togetherness"  in  detergent  micelles.  To  address  this  question, 
we  developed  a  theoretical  formalism  for  describing  the  expected  random  distribution 
for  non-associating  proteins  in  micelles.  In  light  of  this  theory,  we  find  that  erbBI,  erbB2 
and  erbB3  transmembrane  domains  dimerize  at  a  level  consistent  with  the  random 
expectation,  while  erbB4  dimerizes  only  slightly  more.  Nevertheless,  the  inclusion  of 
transmembrane  domains  in  micelles  can  enhance  the  apparent  association  of  a 
covalently  linked  extramembranous  domain. 

We  apply  our  thermodynamic  formalism  to  re-interpret  some  data  published  by 
Tanner  &  Kyte  (1999),  and  we  find  that  our  results  are  consistent  with  their  findings 
when  their  data  are  considered  using  our  approach. 

The  following  manuscripts  were  published  based  on  our  research  findings: 
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FJ  Kobus  &  KG  Fleming  (2005)  “The  GxxxG-containing  transmembrane  domain  of 
the  CCK-4  oncogene  does  not  encode  preferential  self-interactions"  Biochemistry  44: 
1464-1470. 

AM  Stanley  &  KG  Fleming  (2005)  “The  transmembrane  domains  of  erbB  receptors 
do  not  dimerize  strongly  in  micelles”  J.  Mol.  Biol.  347:  759-72. 

KG  Fleming  (2005)  “Analysis  of  membrane  proteins  using  analytical 
ultracentrifugation"  (Invited  Book  Chapter)  In  Press.  Full  bibliographical  information  not 
yet  available. 

Specific  Research  Accomplishments. 

1.  Assay  for  heterodimerization  of  erbB  transmembrane  domains.  (Task  Two, 
Parts  A  &  B) 

1. 1.  SDS-PAGE  Analysis  of  Potential  Heterodimers. 

The  goal  of  this  task  was  to  determine  whether  the  erbB  transmembrane 
domains  would  mediate  heterodimer  formation.  Our  initial  screen  was  carried  out  using 
SDS-PAGE  analysis,  and  we  considered  all  six  possible  heterodimer  configurations. 
SDS-PAGE  analysis  can  detect  the  presence  of  strong  transmembrane  helix-helix 
interactions,  in  which  case  the  proteins  would  migrate  in  oligomeric  forms.  The  results 
are  shown  in  Figure  1 . 


1+2  1+3  1+4  2+3  2+4  3+4 


Dimer 


Monomer 


Figure  1 :  SDS-PAGE  analysis  of  SNerbB  mixing  experiments  using  a  20%  acrylamide  pHast  gel.  All 

samples  migrated  to  a  position  consistent  with  monomers  in  solution.  Lane  1 ,  Markers;  Lane  2,  SNerbB1/B2; 
Lane  3,  SNerbB1/B3;  Lane  4,  SNerbB1/B4;  Lane  5,  SNerbB2/B3;  Lane  6,  SNerbB2/B4;  Lane  7,  SNerbB3/B4. 

As  can  be  seen  in  Figure  1,  we  found  that  no  pair  of  erbB  transmembrane  domains 
dimerized  strongly. 

1.2.  Sedimentation  Equilibrium  Analysis  of  Potential  Heterodimers 

We  also  used  sedimentation  equilibrium  to  attempt  to  detect  the  presence  of 
modest  interactions  mediated  between  SNerbB2  and  SNerbB3.  We  chose  this  pair 
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because  the  erbB2/erbB3heterodimer  is  the  most  potent  signaling  complex  [4,  5],  and 
only  weak  heterointeractions  can  be  detected  between  the  soluble  ligand-binding 
domains  [6],  so  we  hypothesized  that  if  transmembrane  helix-helix  interactions  were 
important  for  stabilizing  any  of  the  hetero-complexes  it  would  be  the  erbB2/  erbB3 
complex.  Our  results  are  shown  in  Figure  2.  They  demonstrate  that  an  equimolar 
amount  of  SNerbB2  and  SNerbB3  at  23  mMCaEs yielded  a  molecular  mass  consistent 
with  protein  monomers  in  solution. 


Figure  2:  A  representative  sedimentation  equilibrium  data  set  for  a  mixture  of  equimolar 
amounts  of  SNerbB2  and  SNerbB3  fit  to  a  single  ideal  species.  The  raw  data  are 
represented  as  open  circles  and  the  continuous  line  represents  the  fit  and  the  upper  panel 
shows  the  residuals  for  the  fit.  The  data  are  well  described  by  a  single  ideal  species  with  a 
molecular  mass  of  21,273  kDa,  which  is  equal  to  the  average  of  the  two  monomers,  SNerbB2 
(20,876  kDa)  and  SNerbB3  (21,669  kDa). _ 


2.  A  Theoretical  Formalism  to  Explain  the  Modest  Free  Energies  of  Self- 

Association  that  we  observed.  (Replaces  Task  Three  due  to  the  results  from 
the  experiments  in  Tasks  1  and  2.) 

2. 1 .  Theoretical  formalism. 

The  goal  of  this  research  was  to  determine  whether  the  modest  free  energies  of  self¬ 
association  we  observed  for  the  erbB  transmembrane  domains  represented  a 
thermodynamically  meaningful  preferential  association  of  the  population. 

The  crux  of  the  issue  lies  in  the  fact  that  the  sedimentation  experiments  were  carried 
out  at  proteimdetergent  ratios  much  higher  than  were  previously  used  for  glycophorin  A 
[7]  in  order  to  observe  SN-erbB  interactions.  At  the  highest  SN-erbB  protein 
concentration  the  number  of  protein  molecules  approached  the  number  of  micelles. 
Since  the  fundamental  measurement  in  the  sedimentation  equilibrium  under  the  current 
experimental  conditions  is  the  protein  mass,  two  protein  molecules  that  occupy  the 
same  micelle  will  sediment  together  and  be  measured  as  a  dimer  whether  or  not  there 
is  a  preferential  interaction  between  them.  This  co-partitioning  phenomenon  has  been 
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referred  to  in  the  literature  as  “artifactual  togetherness”  [8]  and  should  only  occur  at  high 
protein  concentrations  coupled  with  low  detergent  micelle  concentrations.  We 
hypothesize  that  this  co-partitioning  is  due  to  a  type  of  excluded  volume  effect  and  that 
these  conditions  can  be  thought  of  as  thermodynamically  nonideal.  A  similar  excluded 
volume  effect  has  been  indicated  to  occur  in  soluble  proteins  at  high  protein 
concentrations  or  in  the  presence  of  small  molecules  that  induce  crowding  conditions 
[9].  To  determine  the  extent  of  thermodynamically  meaningful  preferential  interactions 
between  the  SN-CCK4  transmembrane  proteins  it  is  first  necessary  to  distinguish  them 
from  the  statistical  protein  interactions  induced  by  the  limiting  solvent  conditions. 

To  develop  a  model  describing  the  distribution  of  non-interacting  proteins  in  micelles, 
we  employed  a  statistical  approach  that  considers  distribution  of  proteins  in  micelles  in 
terms  of  the  random  occupancy  of  placing  balls  in  bins,  where  the  protein  molecules 
represent  balls  and  the  individual  micelles  represent  bins.  Using  this  formalism,  the 
random  occupancy  of  any  particular  micelle  by  any  particular  protein  molecule  can  be 
described  as  a  Bernoulli  event  (e.g.  either  the  micelle  contains  that  protein  or  not), 
which  can  be  described  by  a  binomial  distribution: 


m 


Pr[X,.  =  *H  \p\l-p ) 


i -k 


(1) 


where  is  the  random  variable  that  counts  the  number  of  proteins  in  a  particular 
micelle,  i,  k  is  the  number  of  proteins  occupying  a  micelle  ( e.g .  the  “oligomeric  state”), 
m  is  the  number  of  protein  molecules,  p  is  the  probability  of  each  Bernouilli  trial,  which 
equals  1  In  and  n  is  the  number  of  micelles.  When  the  number  of  proteins  and  the 
number  of  micelles  both  exceed  the  allowed  occupancy  of  a  particular  micelle,  k,  this 
binomial  distribution  is  approximated  by  the  Poisson  distribution: 


lim(m,«»^):Pr[X,  =A:]»i|— j  e'mln  (2) 

To  consider  the  statistical  protein  distribution,  we  write  the  partition  function,  P, 
as  the  sum  all  those  probabilities  for  which  £>0.  (e.g.  For  this  purpose,  we  are  not 
interested  in  the  distribution  of  "empty"  micelles).  To  convert  each  probability  to  a 
relative  concentration,  each  term  is  multiplied  by  the  number  of  proteins  that  the 
probability  represents. 


P-2*Pr[X,.*]  (3) 

A>0 

The  fraction  of  any  particular  species  can  then  be  calculated  as  a  function  of  the 
mole  fraction  protein  {min)  in  the  micellar  phase  from  this  partition  function  by  dividing 
its  contribution  by  the  sum  of  all  species.  For  example  the  fraction  dimer  can  be 
calculated  as: 


ft 


Dimer 


2Pr[X,.  =  2] 

2*Prfr-*] 


*>0 


(4) 
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2.2.  Application  of  this  formalism  to  the  SN-erbB  Distributions 

The  exact  form  of  the  partition  function  and  the  distribution  of  species  will  depend 
on  the  allowed  values  of  k  in  the  model.  In  this  study  in  the  SN-erbB  proteins  we  used 
the  species  as  determined  by  SDS-PAGE  crosslinking  as  a  guide,  and  limited  the 
species  to  monomer,  dimer  and  trimer.  In  addition,  we  postulate  that  an  unlimited 
number  of  proteins  in  a  micelle  would  not  be  physically  possible.  Experimentally,  this 
would  behave  as  an  aggregated  protein,  and  we  approximated  this  as  a  micelle 
containing  ten  monomers.  While  we  did  not  observe  any  aggregation  in  our 
sedimentation  equilibrium  experiments,  we  wished  to  calculate  the  statistical  distribution 
over  wide  range  of  mole  fraction  values.  Including  this  parameter  in  the  Poisson  model 
allows  us  to  approximate  when  this  might  occur,  and  by  including  this  term  in  the 
partition  function,  we  confirmed  that  decamer  is  not  a  significant  species  within  the 
concentration  range  shown  in  Figure  3  (data  not  shown).  Thus,  for  clarity,  we 


Figure  3.  Species  Distribution  of  SN-erbB4.  it  can  be  seen  that  the  experimentally  observed  monomeric 
species  of  SN-erbB4  decays  only  slightly  more  rapidly  that  the  random  expectation,  and  that  there  is  a 
significant  non-negligible  concentration  of  stochastic  dimer  and  trimer  under  the  experimental 
concentrations  used.  The  species  distributions  for  the  SN-erbBI,  SN-erbB2  and  SN-erbB3  proteins  are 
essentially  the  same  as  the  random  distributions  shown  here.  Thus,  we  must  conclude  that  the  free  energy 
of  preferential  interactions  for  these  three  transmembrane  domains  (as  isolated  domains)  are  extremely 
modest  and  on  the  order  of  1  kcal  mol-1  or  less. 

The  blue,  red  and  green  lines  represent  monomeric,  dimeric  and  trimeric  species,  respectively. 
The  lines  with  a  thickened  portion  represent  the  experimentally  observed  species  distributions.  The  lines 
that  are  completely  thin  represent  the  stochastic  distributions  calculated  from  our  theoretical  model. 
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subsequently  removed  this  term,  and  the  final  partition  function  contains  only 
monomeric,  dimeric  and  trimeric  species. 

Since  we  observed  the  most  self-association  for  the  SN-erbB4  construct,  we 
found  it  most  useful  to  compare  the  distribution  of  the  species  for  this  transmembrane 
domain  with  the  expected  random  distribution.  This  is  shown  in  Figure  3. 

2.3.  Sensitivity  analysis  of  the  micelle  aggregation  number  in  calculating  the  random 

distribution  of  oligomers. 

A  parameter  in  this  statistical  model  required  to  convert  micelles  to  total  micellar 
detergent  concentration  is  the  micelle  aggregation  number,  where  the  latter  is  the 
experimental  variable.  Because  of  questions  regarding  the  impact  of  this  number  on  the 
random  distribution,  we  used  a  partition  function  containing  only  monomeric  and  dimeric 
species  to  determine  the  dependence  of  the  random  distribution  on  the  aggregation 
number.  To  compare  the  results  of  our  simulation  with  a  simple  monomer-dimer 
experimental  system,  we  explored  this  issue  in  depth  on  a  study  of  the  interactions 
mediated  by  the  transmembrane  domain  of  the  CCK-4  oncogene.  CCK-4  is  a  recently 
cloned  colon  carcinoma  kinase  and  it  is  thought  to  represent  a  new  member  of  the 
receptor  tyrosine  kinase  super  family  [10].  Like  the  transmembrane  domains  of  the  erbB 
receptors,  the  transmembrane  domain  of  CCK-4  contains  a  conserved  GxxxG 
sequence  thought  to  promote  interactions  between  transmembrane  helices  [11,  12]. 
The  evolutionary  conservation  of  the  CCK-4  transmembrane  sequence  suggested  the 
hypothesis  that  it  may  play  a  functionally  important  role  in  signaling  that  involves 
transmembrane  helix-helix  interactions.  Like  the  SN-erbB  receptors,  we  found  extremely 
modest  interactions  between  the  CCK-4  transmembrane  domains. 

To  calculate  the  dependence  of  the  SN-CCK-4  random  distribution  on  micelle 
aggregation  number,  we  assumed  a  range  of  aggregation  values.  For  protein-free 
micelles,  we  assumed  an  aggregation  number  for  Cl  4  betaine  micelles  of  100,  which 
lies  intermediate  between  the  published  values  for  Cl 2  betaine  and  Cl 6  betaine 
micelles  [13,  14],  For  protein-containing  micelles,  we  recognize  that  the  number  of 
detergents  bound  may  or  may  not  be  similar  to  the  protein-free  aggregation  number, 
and  it  may  also  be  that  the  monomeric  and  dimeric  forms  of  the  protein  would  be 
solvated  by  a  micelle  of  a  different  size.  Under  the  density-matching  conditions  used  for 
sedimentation  equilibrium,  there  is  no  experimental  information  about  the  size  of  the 
micelle  required  to  solvate  the  transmembrane  regions  of  the  protein  molecules.  For 
large  membrane  proteins  le  Maire  and  colleagues  have  postulated  that  the  amount  of 
detergent  bound  depends  on  the  hydrophobic  protein  surface  area  [15-17],  However,  for 
smaller  membrane  proteins,  such  as  the  single  transmembrane  helix  of  CCK-4,  it  is 
thought  that  the  transmembrane  region  may  serve  as  an  organizing  hydrophobic 
nucleus  surrounded  by  detergent  molecules  that  bind  in  a  micellar  fashion,  and  the 
amount  of  detergent  bound  to  the  glycophorin  A  transmembrane  helix  is  similar  to  the 
aggregation  number  for  protein-free  micelles  [18].  We  therefore  used  the  protein-free 
number  as  a  starting  approximation  of  the  number  of  detergent  molecules  bound,  and 
then  we  varied  this  number  from  50-150  to  account  for  our  uncertainty.  As  long  as  the 
number  of  detergents  bound  is  within  this  range,  our  results  shown  in  Figure  3  indicate 
that  the  exact  value  is  not  an  extremely  sensitive  factor  in  this  analysis,  and  the 
fundamental  conclusion  is  the  same. 
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Figure  4  shows  the  mole  fraction  distribution  of  species  for  SN-CCK-4  overlaid 
upon  the  mole  fraction  distribution  of  species  from  the  random  distribution  model 
containing  monomer  and  dimer  forms.  It  can  be  seen  that  the  decrease  in 
experimentally  observed  monomeric  SN-CCK4s  tracks  well  with  the  decrease  in  the 
statistical  distribution  of  monomer  using  an  aggregation  number  of  100  as  depicted  by 
the  blue  broken  line.  Concomitantly,  the  increase  in  experimentally  observed  dimeric 
SN-CCK4  is  well  approximated  by  the  expected  statistical  distribution  of  dimer.  Even  if 
the  number  of  detergents  bound  varies  by  ±50,  the  experimentally  observed  SN-CCK4s 
species  deviate  only  slightly  from  the  random  distributions  suggesting  that  the  number 
of  detergent  molecules  solvating  each  species  is  a  relatively  insensitive  parameter.  The 
results  are  very  similar  when  the  partition  function  for  the  random  model  includes  an 
additional  trimeric  form  (data  not  shown). 


Figure  4.  The  SN-CCK4s  species  distribution  is  similar  to  the  stochastic  distribution  in 
micelles. 

The  solid  blue  and  red  lines  are  the  experimentally  determined  monomer  and  dimer 
distributions,  respectively.  The  thickened  portions  of  these  curves  represent  those  concentrations  that 
were  experimentally  observed,  and  the  thin  portions  of  these  curves  represent  extrapolations  of  those 
distributions  to  the  equilibrium  midpoint.  The  broken  blue  and  red  lines  are  the  monomer  and  dimer 
distributions  predicted  from  the  Poisson  distribution  of  proteins  in  micelles  assuming  no  preferential 
interaction  with  an  aggregation  number  of  100.  The  shaded  regions  represent  the  area  covered  by  the 
distributions  allowing  the  aggregation  number  to  vary  from  50-150.  Mole  fraction  on  the  abscissa  is 
defined  as  the  ratio  of  moles  of  protein  to  moles  of  micellar  detergent. 
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We  can  consider  the  free  energy  of  preferential  interactions  for  SN-CCK4  as  a 
reflection  of  the  deviation  of  the  experimental  monomer-dimer  midpoint  from  that  of  the 
stochastic  distribution.  Within  the  range  of  distributions  shown  by  the  shaded  regions  of 
Figure  4,  the  comparison  demonstrates  that  the  free  energy  of  any  preferential  self¬ 
interactions  between  SN-CCK4  proteins  in  micelles  is  minimal  and  certainly  less  than 
one  kcal  mol'1.  By  comparison  with  the  calculated  random  distributions,  it  can  be  seen 
that  the  midpoints  overlay  almost  exactly  if  the  average  aggregation  number  equals  50. 
In  this  case,  we  can  conclude  that  the  dimerization  propensity  of  SN-CCK4s  arises 
solely  from  the  random  distribution  in  micelles;  e.g.  association  of  SN-CCK4  is 
completely  due  to  overloading  of  the  micelles  with  protein  and  not  from  any  preferential 
interactions  between  the  helices.  On  the  other  hand,  if  the  average  aggregation  number 
is  150,  the  midpoint  of  the  experimental  curve  (1.5x10"2)  falls  to  right  of  the  stochastic 
curve  (5x1  O'3).  This  result  might  suggest  the  presence  of  slight  repulsive  interactions 
between  the  SN-CCK4  proteins.  Physically,  this  is  possible  using  this  stochastic  model, 
because  there  is  a  significant  population  of  protein-free  micelles  at  the  midpoint  of  the 
random  distribution.  In  this  latter  case,  the  mole  fraction  concentration  difference 
between  the  midpoints  of  the  random  and  experimental  populations  is  approximately 
three-fold.  From  this,  we  can  estimate  the  free  energy  of  repulsion  as  being  on  the  order 
of  the  available  thermal  energy  at  25°C.  Thus,  even  if  with  the  uncertainty  of  knowing 
the  size  of  the  protein-micelle  complex,  we  can  conclude  that  the  transmembrane 
domains  of  CCK4,  like  those  of  the  SN-erbB  receptors  and  unlike  those  of  glycophorin 
A,  do  not  encode  a  thermodynamically  meaningful  preferential  self-interaction  in 
micelles. 

2.4.  A  transmembrane  domain  lacking  preferential  interactions  can  still  enhance  the 
interactions  of  a  covalently  linked  soluble  domain:  Reinterpretation  of  the  Tanner  &  Kyte 
data  and  Implications  for  the  affinity  ofEGF  for  the  EGF  receptor  ligand-binding  domain. 

In  studies  of  SNGpA  fusion  proteins,  it  has  been  shown  that  it  is  appropriate  to 
treat  the  micelles  as  a  distinct  thermodynamic  phase  and  to  consider  the 
transmembrane  segments  confined  to  this  phase  [19-21].  Such  a  consideration  leads  to 
the  expression  [19]: 

A  Gm  =  AG;  +  RT\ n[  MicellarDet]w  (5) 

where  AGapp  is  the  observed  apparent  free  energy  in  bulk  aqueous  concentrations,  A G°x 

is  the  standard  state  mole  fraction  free  energy  of  association,  R  is  the  universal  gas 
constant,  T  is  the  temperature  (K),  and  [Micellar  Det]w  is  the  aqueous  concentration  of 
the  detergent  in  micelles.  The  first  term  represents  the  association  free  energy  of  the 
intrinsic  chemical  components  of  the  association  reaction,  and  the  second  term 
represents  a  statistical  energy  of  mixing,  the  magnitude  of  which  varies  with  the  micellar 
detergent  concentration.  In  the  case  of  GpA,  we  believe  the  contribution  of  the  soluble 
SN  domain  to  the  standard  state  free  energy,  AG"  is  negligible,  and  we  interpret  AG"  as 
the  contribution  from  interactions  between  the  transmembrane  domains. 
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However,  the  soluble  domain  of  EGFR  (erbBI),  when  ligand-bound,  does 
interact.  Equation  [5]  predicts  that  extending  these  fragments  to  include  a  hydrophobic 
transmembrane  segment  should  enhance  the  apparent  dimerization  affinity  of  the  ECDs 
even  in  the  absence  of  specific  TM  interactions.  The  TM  domain  would  be  restricted  to 
the  volume  of  the  micellar  phase,  and  thus  the  apparent  free  energy  of  association  of 
the  ECDs  would  now  contain  a  contribution  from  the  statistical  energy  term  described 
above  as  well  as  the  intrinsic  free  energy  of  dimerization  of  the  soluble  domains,  AGint. 
Applying  the  above  formalism,  we  can  substitute  AGint,  for  AG"  and  rearranging 
equation  [5]  to  write  the  apparent  association  equilibrium  constant  for  an  ECD-TM 
construct  in  micelles,  Kapp,  as: 

K°pp  =  [micellar  Det Jw  ^ 

The  equilibrium  constants  in  equation  [6]  are  in  bulk  molar  concentration  units. 
From  eqn  [6],  one  would  expect  that  the  association  of  the  receptor  would  depend  on 
detergent  concentration,  with  a  tighter  apparent  association  expected  at  lower  detergent 
concentrations,  and  it  has  in  fact  been  shown  that  the  full-length  EGFR,  as  well  as  a 
truncated  fragment  resembling  v-erbB,  interact  more  strongly  at  low  detergent 
concentrations  [22,  23], 

A  study  by  Tanner  &  Kyte,  which  compared  the  ligand-induced  dimerization  of  the 
ECD  of  erbBI  to  the  dimerization  of  a  fragment  extended  to  include  the  TM,  is  also 
consistent  with  this  formalism.  This  study  found  the  longer  fragment  did  indeed 
associate  more  readily,  although  complete  dimerization  was  never  observed  [24],  The 
dimerization  of  the  longer  fragment  was  measured  in  1%  Triton  X-100  (15mM),  and 
equation  [6]  predicts  that  restriction  of  the  soluble  fragment  to  the  micellar  phase  would 
lead  to  an  apparent  67-fold  enhancement  in  the  association.  By  comparing  the 
dimerization  of  the  ECD  alone,  which  had  been  shown  to  be  complete  at  40pM  protein 
[25],  with  the  dimerization  of  the  ECD-TM  fragment  in  Triton-micelles  that  they  observed 
(2nM),  the  authors  estimated  an  observed  enhancement  of  approximately  10,000-fold 
[24],  However,  a  subsequent  study  reports  complete  ligand-induced  dimerization  of  the 
extracellular  domain  of  erbBI  at  concentrations  as  low  as  4pM  protein  [6],  so  the 
enhancement  of  ligand-induced  dimerization  of  the  ECD  by  the  addition  of  the  TM  may 
be  less  than  originally  suggested  and  be  more  on  the  order  of  1,000-fold.  Equation  [6] 
suggests  a  large  portion  of  this  is  simply  the  result  of  restricting  the  longer  fragment  to 
be  stably  inserted  in  the  micelles.  If  approximately  65-fold  arises  from  limiting  the 
protein  to  the  micellar  phase,  only  a  1 5-30  fold  enhancement  arises  from  other  features 
of  the  ECD-TM  construct.  This  additional  enhancement  is  modest,  and  it  could  arise 
from  stabilization  in  the  longer  construct  of  structures  in  the  juxtamembrane  region 
important  for  dimerization  or  from  restriction  of  the  relative  orientation  of  two  ECDs  on 
the  surface  of  a  micelle.  The  role  of  the  transmembrane  domain  in  the  differences 
observed  in  the  Tanner  and  Kyte  study  is  still  unclear,  but  our  results  suggest  that  it 
may  not  be  necessary  to  invoke  strong  specific  interactions  between  the  TM  domains  to 
account  for  the  observed  enhancement. 
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3.  Development  of  protocols  for  evaluating  transmembrane  protein-protein 
interactions  in  lipid  bilayers  (Task  2,  Part  C) 

In  light  of  the  modest  interactions  we  observed  for  the  erbB  transmembrane 
domain  interactions  in  detergent  micelles,  we  find  it  compelling  to  develop  experimental 
approaches  for  the  analysis  of  membrane  protein  interactions  in  lipid  bilayers.  A  critical 
step  in  this  analysis  is  reconstitution  of  the  membrane  protein  in  the  lipid  bilayer.  This 
can  be  most  easily  carried  out  by  resuspending  the  membrane  protein  +  lipid  mixture 
from  organic  solvents  into  an  aqueous  buffer  of  interest.  The  use  of  the  SN-erbB  fusion 
protein  is  not  ideal  for  this  approach,  and  so  we  have  begun  studies  using  a  synthetic 
peptide  corresponding  to  the  transmembrane  domain  of  the  rat  neu  homologue.  We 
chose  to  make  this  peptide  first,  since  it  had  been  previously  indicated  in  the  literature 
that  this  sequence  was  amenable  to  chemical  synthesis  using  fmoc  approaches.  Mass 
spectrometry  analyses  are  shown  in  Figure  5  for  the  crude  and  purified  peptide. 

We  have  carried  out  a  preliminary  sedimentation  equilibrium  experiment  on  this 
peptide,  which  confirms  that  the  self-association  in  micelles  is  weak,  and  we  are 
currently  working  on  reconstitution  conditions  into  bilayer. 
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Figure  5.  Mass  spec  analysis  of  crude  (top)  and  HPLC  purified  (bottom)  neu  transmembrane 
peptide.  The  mass  of  -3910  corresponds  to  the  35-mer  sequence  that  we  successfully  synthesized. 
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4.  Plans  for  the  fourth  funding  year  of  this  award  (Career  Award  only). 

1 .  In  the  coming  year  we  are  focused  on  developing  experimental  methods  for 
measuring  membrane  protein  interactions  in  lipid  bilayers  (Task  2  Part  C).  We 
have  recently  purchased  a  fluorometer  (Installation  anticipated  July  5,  2005), 
and  we  are  poised  to  carry  out  these  studies. 

5.  Abbreviations  Used: 

SN:  Staphylococcal  nuclease 

GpA:  glycophorin  A 

SDS-PAGE:  Sodium  dodecyl  sulfate  polyacrylamide  gel  electrophoresis 

IPTG  Isopropyl-D-thiogalactopyranoside 

TM  Transmembrane 

6.  Reportable  Outcomes 

Three  manuscripts  were  published  in  the  last  granting  period. 

FJ  Kobus  &  KG  Fleming  (2005)  “The  GxxxG-containing  transmembrane  domain  of 
the  CCK-4  oncogene  does  not  encode  preferential  self-interactions"  Biochemistry  44: 
1464-1470. 

AM  Stanley  &  KG  Fleming  (2005)  “The  transmembrane  domains  of  erbB  receptors 
do  not  dimerize  strongly  in  micelles”  J.  Mol.  Biol.  347:  759-72. 

KG  Fleming  (2005)  “Analysis  of  membrane  proteins  using  analytical 
ultracentrifugation”  (Invited  Book  Chapter)  In  Press.  Full  bibliographical  information  not 
yet  available. 

7.  Conclusions. 

Our  results  represent  the  first  quantitative  measurement  of  the  self-association 
propensity  for  these  erbB  receptor  transmembrane  domains.  All  erbB  transmembrane 
domains  drive  self-association  of  the  SN-chimeric  construct  in  11  mM  CsEs  micelles.  If 
the  concentration  of  detergent  micelles  is  increased,  the  proteins  are  effectively  diluted. 
In  23  mM  C8E5,  the  SNerbBI,  SNerbB2  and  SNerbB3  proteins  are  monomeric  whereas 
the  SNerbB4  protein  still  shows  some  level  of  association  under  this  condition.  Upon 
increasing  the  detergent  concentration  further  to  50  mM  CsEs,  the  SNerbB4  protein  is 
completely  monomeric.  We  developed  a  theoretical  model  to  explain  the  expected 
random  distribution  of  oligomers  in  micelles,  and  we  find  that  the  SN-erbBI,  SN-erbB2 
and  SN-erbB3  species  distributions  are  well  described  by  this  model  whereas  the  SN- 
erbB4  distribution  shows  a  slight  amount  of  thermodynamically  meaningful  preferential 
association.  We  estimate  the  free  energy  of  an  preferential  interactions  to  be  on  the 
order  of  1  kcal  mol'1  or  less. 

We  applied  our  thermodynamic  model  to  analyze  the  previously  published  data 
of  Tanner  &  Kyte  [24],  We  find  that  the  covalent  attachment  of  a  transmembrane 
domain  to  a  soluble  (ligand-binding)  domain,  can  enhance  the  interactions  of  the  soluble 
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domain.  Thus,  strong  preferential  interactions  between  transmembrane  domains  are  not 
required.  We  propose  that  interactions  between  the  transmembrane  domains  need  only 
be  sterically  permissible,  and  that  the  full-length  receptor  may  have  many  sites  of  weak 
interactions. 
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abstract:  The  recently  cloned  colon  carcinoma  kinase  4  (CCK4)  oncogene  contains  an  evolutionarily 
conserved  GxxxG  motif  in  its  single  transmembrane  domain  (TMD).  It  has  previously  been  suggested 
that  this  pairwise  glycine  motif  may  provide  a  strong  driving  force  for  transmembrane  helix— helix 
interactions.  Since  CCK4  is  thought  to  represent  a  new  member  of  the  receptor  tyrosine  kinase  family, 
interactions  between  the  TMDs  may  be  important  in  receptor  self-association  and  activation  of  signal 
transduction  pathways.  To  determine  whether  this  conserved  CCK4  TMD  can  drive  protein— protein 
interactions,  we  have  carried  out  a  thermodynamic  study  using  the  TMD  expressed  as  a  Staphylococcal 
nuclease  (SN)  fusion  protein.  Similar  SN— TMD  fusion  proteins  have  been  used  to  determine  the  sequence 
specificity  and  thermodynamics  of  transmembrane  helix— helix  interactions  in  a  number  of  membrane 
proteins,  including  glycophorin  A.  Using  sedimentation  equilibrium  in  C14  betaine  micelles,  we  discovered 
that  the  CCK4  TMD  is  unable  to  drive  strong  protein— protein  interactions.  At  high  protein/detergent 
ratios,  the  SN— CCK4  fusion  protein  will  dimerize,  but  a  stochastic  model  for  protein  association  in  micelles 
can  explain  the  observed  dimer  population.  For  low-affinity  interactions  such  as  the  one  studied  here,  an 
understanding  of  this  discrete  stochastic  distribution  of  membrane  proteins  in  micelles  is  important  for 
distinguishing  between  preferential  and  random  self-interactions,  which  can  both  influence  the  oligomeric 
population.  The  lack  of  a  thermodynamically  meaningful  self-association  propensity  for  the  CCK4  TMDs 
demonstrates  that  a  GxxxG  motif  is  not  sufficient  to  drive  transmembrane  helix— helix  interactions. 


Receptor  tyrosine  kinases  (RTKs)1  are  known  to  play  an 
important  role  in  the  development  and/or  progression  of 
many  forms  of  cancer.  A  molecular  understanding  of  the 
mechanisms  by  which  these  receptors  promote  cellular 
transformation  is  critical  for  the  understanding  and  treatment 
of  human  cancer.  In  normal  cells,  RTKs  are  transmembrane 
signaling  proteins  that  transmit  biological  signals  from  the 
extracellular  environment  to  the  interior  of  the  cell.  The 
regulated  transmission  of  these  signals  is  important  for  many 
cellular  events,  such  as  cell  growth  and  differentiation,  axonal 
growth,  epithelial  growth  and  development  (/).  RTKs  are 
misregulated  in  many  human  cancers  including  breast  cancer 
(erbB2/Her2)  (2, 3),  ovarian  cancer  (erbB2)  (2, 3),  melanoma 
(CCK4)  (4),  and  colon  cancer  (erbBl  and  CCK4)  (5,  6).  The 
predominant  RTK-related  genetic  alteration  in  human  cancer 
is  not  mutation  of  the  receptor  protein,  but  rather  it  is 
amplification  of  the  gene.  Subsequent  overexpression  of 
receptor  proteins  leads  to  constitutive  stimulation  of  the  RTK 
activity  and  uncontrolled  cellular  signaling. 
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Recently,  colon  carcinoma  kinase  4  (CCK4),  a  new 
member  of  the  receptor  tyrosine  kinase  superfamily  was 
cloned  from  human  colon  carcinoma-derived  cell  lines  (7). 
The  CCK4  mRNA  for  this  gene  was  variably  expressed  in 
colon  carcinoma  derived  cell  lines  but  not  expressed  in 
human  adult  colon  tissues.  In  contrast,  it  is  expressed  in  fetal 
colon  of  the  mouse  (7).  These  observations  suggest  that  the 
normal  role  for  CCK4  may  be  in  development  of  the  colon 
followed  by  down  regulation  in  the  adult  tissue.  The  unusual 
expression  of  CCK4  in  colon  carcinoma  cell  lines  might 
indicate  that  the  protein  is  functioning  abnormally  in  adult 
colon  cells,  which  suggests  a  role  in  colon  carcinoma 
development  and/or  proliferation. 

Orthologues  for  CCK4  have  been  cloned  from  other 
species  (8—10).  Amino  acid  residues  within  the  transmem¬ 
brane  domain  (TMD)  are  among  the  conserved  features  (9). 
The  CCK4  TMD  is  55%  identical  to  the  transmembrane 
sequences  of  the  chicken  Klg  and  Hydra  Lemon  orthologues 
(9),  and  a  pattern  of  residues  with  helical  periodicity  is 
conserved.  Within  the  conserved  residues  of  the  TMD  is  a 
GxxxG  sequence,  a  motif  thought  to  promote  interactions 
between  transmembrane  helices  (11,  12).  The  evolutionary 
conservation  of  the  CCK4  transmembrane  sequence  suggests 
the  hypothesis  that  it  may  play  a  functionally  important  role 
in  signaling  that  involves  transmembrane  helix— helix  inter¬ 
actions.  To  test  whether  this  CCK4  TMD  may  serve  as  a 
driving  force  for  protein— protein  interactions,  we  carried  out 
a  thermodynamic  study  of  the  TMD  using  sedimentation 
equilibrium  in  detergent  micelles. 
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Table  1 :  Transmembrane  Sequences  Used  in  This  Study" 

Protein  SN-TM  fusion  sequence  aligned  by  the  GxxxG  motif 

CCK4S  QLAVHHFSEPG  IGLSVGAAVAYI IAVLGLMFYSKKR 

CCK4xs  qlavhhfsepgmiqtiGlsvGaavayiiavlglmfyskkr 

★  ★  ★  *  *  *  *  *  *  ***  Je  -k 

GpA WT  qlavhhfsepeitliifGvmaGvigtillisygirrli 


"  Sequences  are  aligned  by  the  GxxxG  motif.  The  predicted  transmembrane  domains  are  underlined.  The  N-terminal  sequence  shows  the  junction 
between  the  SN  sequence  and  the  cloned  transmembrane  domain  of  interest.  The  conserved  residues  in  the  transmembrane  domain  are  denoted  with 
a  star.  In  a  manner  analogous  to  the  GpA  construct,  positive  charges  were  added  to  the  C-termini  of  the  CCK4  sequences.  The  SN— CCK4xs 
contains  the  extended  linker  region. 


EXPERIMENTAL  METHODS 

Cloning  and  Sample  Preparation.  An  image  clone  for  the 
CCK4  protein  was  purchased  from  Invitrogen.  The  sequence 
encoding  the  transmembrane  region  was  amplified  using  the 
polymerase  chain  reaction  using  primers  containing  unique 
Smal  and  BamH  I  sites,  and  the  PCR  product  was  cloned 
C-terminal  to  Staphylococcal  nuclease  (SN)  in  an  IPTG- 
inducible  pETlla  expression  construct  from  which  the  open 
reading  frame  for  the  glycophorin  A  transmembrane  had  been 
excised.  The  DNA  sequence  was  verified  by  double-stranded 
DNA  sequencing.  The  Cys-to-Ser  mutation  (immediately 
before  the  KKR)  was  introduced  using  the  QuickChange 
mutagenesis  kit.  The  SN-CCK4s  and  SN-CCK4xs  proteins 
were  purified  using  the  published  protocol  for  the  SN-GpA 
proteins  (13).  Immediately  before  sedimentation  equilibrium 
analysis,  samples  were  exchanged  by  ion-exchange  chro¬ 
matography  into  buffer  containing  10  mM  C14  betaine. 
SigmaUltra  grade  C14  betaine  was  purchased  from  Sigma. 

Sedimentation  Equilibrium  Analytical  Ultracentrifugation. 
Sedimentation  equilibrium  experiments  were  performed  at 
25  °C  using  a  Beckman  XL-A  analytical  ultracentrifuge  using 
six-sector  cells  equipped  with  quartz  windows  in  an  AnTi 
60  rotor  as  described  previously  in  detail  (13—15).  The  sam¬ 
ples  were  centrifuged  for  lengths  of  time  sufficient  to  achieve 
equilibrium  determined  by  using  the  MATCH  algorithm. 
Equilibrium  data  obtained  from  absorbance  at  280  nm  were 
analyzed  by  nonlinear  least-squares  curve  fitting  of  radial 
concentration  profiles  with  the  Macintosh  version  of 
NONLIN  (16)  using  the  equations  describing  the  reversible 
association  in  sedimentation  equilibrium.  For  each  global  fit, 
nine  data  sets  were  used  consisting  of  three  different  initial 
protein  concentrations  (initial  A280  absorbance  values  of  0.9, 
0.6,  and  0.3  cm-1)  analyzed  at  three  significantly  different 
speeds  (20000,  24500,  30000  revolutions  per  minute).  The 
monomeric  molecular  masses  and  partial  specific  volumes 
were  calculated  using  the  program  SEDNTERP  (17),  and 
these  parameters  were  held  constant  in  fitting  the  absorbance 
versus  radius  profiles.  The  equilibrium  constant  was  con¬ 
verted  from  absorbance  to  (bulk  molar)  units  using  molar 
extinction  coefficients  of  20952  and  21425  cm-1  M-1  for 
the  SN-CCK4s  and  SN— CCK4xs  proteins,  respectively. 
These  values  were  calculated  by  adjusting  the  experimental 
molar  extinction  coefficient  for  SN  for  the  additional  mass 
of  the  TMD  of  interest  as  has  been  previously  done  for  the 
glycophorin  A  transmembrane  domain  (18). 


The  apparent  free  energy  of  association  was  calculated 
from  the  apparent  monomer— dimer  equilibrium  constant  in 
the  standard  way,  AGApp  =  —  RT  In  KApp.  The  standard 
state  mole  fraction  free  energy  of  association  was  calculated 
using  the  ideal-dilute  assumption  as  AG°  =  —  RT  In 
where  Kx  =  XApp[micellarC  1 4]w  and  [micellarC14]w  is  the 
concentration  of  micellar  C14  betaine  expressed  on  the 
aqueous  molar  scale  (14,  15). 

Sequence  Motifs.  Sequence  motifs  were  identified 
using  the  online  TMSTAT  calculation  (11)  (http:// 
bioinfo.mbb.yale.edu/tmstat/)  with  the  extended  version  of 
the  CCK4  TMD. 

RESULTS 

We  expressed  the  CCK4  TMD  as  a  C-terminal  fusion 
protein  with  Staphylococcal  nuclease  (SN),  an  experimental 
approach  that  has  been  successfully  used  to  map  the  helix- 
helix  interactions  of  glycophorin  A  (13—15, 19—22),  BNIP-3 
(23),  and  phospholamban  (24)  TMDs.  We  have  previously 
shown  that  this  method  using  this  type  of  SN  fusion  construct 
has  resulted  in  a  quantitative  ranking  of  glycophorin  A 
transmembrane  mutants  by  their  ability  to  dimerize  (13,  21, 
22),  and  the  relative  stabilities  of  a  series  of  GpA  alanine 
mutants  is  conserved  between  the  micellar  environment  in 
vitro  and  bacterial  membranes  in  vivo  (13).  The  advantage 
of  using  the  sedimentation  equilibrium  method  is  that  the 
molecular  mass  and  thus  stoichiometries  are  determined.  In 
addition,  in  the  absence  of  protein— protein  association, 
sedimentation  equilibrium  will  still  result  in  a  positive, 
interpretable  result  since  it  will  directly  report  the  monomeric 
molecular  weights. 

The  CCK4  GxxxG-Containing  Transmembrane  Domain 
Self-Associates  Only  Weakly.  Table  1  shows  the  CCK4 
transmembrane  sequences  considered  in  this  study.  Since  the 
GxxxG  motif  in  CCK4  is  located  closer  to  the  predicted 
N-terminus  of  its  TMD,  we  were  concerned  that  self¬ 
association  might  be  sterically  hindered  by  the  SN  portion 
of  the  fusion  protein.  Therefore,  we  extended  the  linker 
region  by  adding  four  additional  residues  from  the  CCK4 
open  reading  frame.  In  addition,  the  C-terminal  end  of  the 
wild-type  CCK4  TMD  contains  a  cysteine  residue,  which 
we  mutated  to  serine.  For  comparison,  the  sequence  of  the 
glycophorin  A  transmembrane  WT  TMDs  is  also  shown.  We 
first  used  SDS— PAGE  to  screen  the  purified  proteins  for 
the  ability  of  the  CCK4  TMD  to  drive  strong  self-association. 
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This  method  demonstrated  that  both  of  the  SN— CCK4  fusion 
proteins  migrate  as  a  monomeric  species  on  SDS— PAGE. 
In  contrast,  the  SN— GpA  fusion  protein  migrates  predomi¬ 
nantly  as  a  dimer  under  those  same  conditions  (data  not 
shown). 

To  further  test  for  a  self-association  propensity  of  the 
CCK4  TMD,  we  carried  out  sedimentation  equilibrium 
analytical  ultracentrifugation  experiments.  Sedimentation 
equilibrium  allows  detection  of  protein— protein  interactions 
over  a  range  of  association  strengths.  In  particular,  we  have 
detected  significant  dimer  populations  of  the  SN-GpA  mutant 
proteins  even  when  these  proteins  are  monomeric  as  mea¬ 
sured  by  SDS— PAGE  (13,  18,  21,  22).  In  addition,  unlike 
SDS— PAGE,  sedimentation  equilibrium  experiments  can 
provide  a  direct  measure  of  the  equilibrium  constant  for  a 
self-association  reaction.  We  chose  to  use  C14  betaine 
micelles  since  this  detergent  micelle  environment  has  been 
used  for  the  analysis  of  several  designed  transmembrane 
helices  (25)  along  with  the  fact  that  the  glycophorin  A  TMD 
dimer  shows  in  ideal-dilute  behavior  in  these  micelles  (75). 

Sedimentation  equilibrium  experiments  carried  out  in  10 
mM  Cl 4  betaine  detergent  micelles  suggest  that  the  SN- 
CCK4s  protein  can  self-associate,  albeit  weakly.  In  four 
independent  experiments,  the  single  ideal  species  fit  returned 
a  molecular  weight  value  that  is  approximately  20%  greater 
than  the  calculated  value  for  the  monomeric  protein,  which 
suggests  the  formation  of  higher  order  species.  To  determine 
the  molecular  weights  of  these  self-associated  forms,  the  data 
were  fit  to  association  models  including  dimers,  trimers,  or 
tetramers,  In  three  of  four  of  the  experiments,  the  monomer- 
dimer  fit  is  the  best  description  of  the  data.  Figure  1  shows 
a  typical  species  distribution  for  this  monomer-dimer  fit, 
where  it  can  be  seen  that  approximately  20%  of  the  SN- 
CCK4s  protein  is  dimeric.  In  the  fourth  independent  experi¬ 
ment,  a  monomer- trimer  fit  was  equally  as  good  as  a 
monomer- dimer  fit;  however,  the  dimerization  constant  for 
the  monomer— dimer  fit  of  these  data  was  consistent  with 
the  other  three  data  sets  and  we  therefore  used  this  value. 

The  average  apparent  free  energy  of  association  deter¬ 
mined  in  10  mM  C14  betaine  micelles  at  25  °C,  A G^pp, 
equals  —5.2  (±0.1)  kcal  mol-1  (TV  =  4).  Under  the  ideal- 
dilute  assumption  (14),  this  apparent  free  energy  would 
correspond  to  a  standard  state  mole  fraction  association  free 
energy,  AG°,  equal  to  —2.40  (±0.1)  kcal  mol-1.  This  value 
is  considerably  weaker  than  that  of  the  SN— GpA  TMD  in 
either  Cl 4  betaine  micelles,  AG°  =  -5.8  kcal  mol-1  (15) 
or  CsEj  micelles,  AG°  =  -7.0  kcal  mol-1  (14).  Figure  2 
shows  a  comparison  of  the  dimer  distributions  for  SN- 
CCK4s  and  SN— GpA  calculated  from  the  standard  state  free 
energy  values  in  C14  betaine  micelles  where  it  can  be  seen 
that  this  free  energy  difference  leads  to  significant  shifts  in 
the  concentration  range  over  which  the  protein  population 
is  predominantly  dimeric.  At  a  total  protein  concentration 
of  10  juM,  the  SN— GpA  fusion  protein  is  >80%  dimeric  in 
10  mM  C14  betaine  micelles.  In  contrast,  the  SN— CCK4s 
protein  is  ~10%  dimeric  at  this  same  overall  protein 
concentration  in  10  mM  C14  betaine  micelles. 

Because  of  the  observation  that  the  self-interactions  were 
so  weak  for  this  first  SN-CCK4  construct,  we  also  carried 
out  sedimentation  equilibrium  experiments  using  the  SN— 
CCK4xs  fusion  protein  containing  an  extended  linker  region 
between  SN  and  the  TMD.  The  results  for  four  independent 


Radius,  cm 

Figure  1 :  SN— CCK4s  species  distribution  in  C 14  betaine  micelles. 
The  lower  panel  shows  a  representative  data  set  for  43  pM  (bulk) 
SN— CCK4s  collected  in  10  mM  C14  betaine  at  20000  rpm.  The 
open  circles  are  the  data,  and  the  line  shows  the  global  fit  resulting 
from  the  global  fit  of  nine  data  sets  for  this  experiment.  The  broken 
lines  represent  the  monomer  and  dimer  contributions  as  determined 
from  the  global  analysis,  which  sum  to  equal  the  fit.  The  upper 
panel  shows  the  residuals  of  the  fit  for  this  data  set,  which  are 
small  and  random. 


sedimentation  equilibrium  experiments  on  the  SN— CCK4xs 
protein  were  within  error  of  the  original  SN— CCK4s  fusion 
protein.  The  average  apparent  free  energy  of  association, 
AG^pp,  was  equal  to  —5.3  (±0.2)  kcal  mol-1  (TV  =  4)  in  10 
mM  C14  betaine  micelles.  Under  the  ideal-dilute  assumption 
(14),  this  apparent  free  energy  would  correspond  to  a 
standard  state  mole  fraction  association  free  energy,  AG°, 
equal  to  —2.50  (±0.2)  kcal  mol-1.  Thus,  extending  the  linker 
region  of  the  CCK4  TMD  using  native  sequence  resulted  in 
no  enhancement  of  protein— protein  interactions. 

DISCUSSION 

To  determine  whether  the  GxxxG-containing  TMD  of 
CCK4  encodes  the  potential  for  strong  helix— helix  self¬ 
association,  we  carried  out  a  thermodynamic  study  using  an 
SN— CC.K4  TMD  fusion  construct.  This  type  of  construct 
has  been  used  in  the  past  to  map  the  sequence  dependence 
of  helix— helix  oligomerization  in  several  transmembrane 
proteins,  most  notably  the  TMD  of  glycophorin  A  (13, 19— 
21).  Using  both  SDS— PAGE  and  sedimentation  equilibrium 
in  detergent  micelles,  we  find  that  the  SN— CCK4  protein 
shows  a  very  weak  propensity  for  self-association. 

The  SN—CCK4  Species  Distribution  Is  Well  Described  by 
a  Random  Distribution  of  Proteins  in  Micelles.  To  observe 
SN— CCK4  interactions,  the  current  experiments  were  carried 
out  at  protein/detergent  ratios  much  higher  than  were 
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Figure  2:  CCK4  is  destabilized  with  respect  to  giycophorin  A.  A 
comparison  of  the  dimer  species  distribution  between  the  SN— GpA 
and  SN— CCK4  transmembrane  proteins  in  10  mM  C14  betaine 
micelles.  The  thin  curves  are  calculated  based  on  the  free  energies 
of  association.  The  thickened  portion  of  the  SN— CCK4  curve 
represents  the  region  of  the  data  that  was  observed  in  the 
sedimentation  equilibrium  experiments.  The  less  favorable  free 
energy  of  association  for  SN— CCK4  shifts  the  population  of  dimer 
to  higher  protein  concentrations. 

previously  used  for  giycophorin  A  (15),  and  at  the  highest 
SN— CCK4  protein  concentration  the  number  of  protein 
molecules  approached  the  number  of  micelles.  Since  the 
fundamental  measurement  in  the  sedimentation  equilibrium 
under  the  current  experimental  conditions  is  the  protein  mass, 
two  protein  molecules  that  occupy  the  same  micelle  will 
sediment  together  and  be  measured  as  a  dimer  whether  or 
not  there  is  a  preferential  interaction  between  them.  This  co¬ 
partitioning  phenomenon  has  been  referred  to  in  the  literature 
as  “artifactual  togetherness”  (26)  and  should  only  occur  at 
high  protein  concentrations  coupled  with  low  detergent 
micelle  concentrations.  We  hypothesize  that  this  co-partition¬ 
ing  is  due  to  an  excluded  volume  effect  and  that  these 
conditions  can  be  thought  of  as  thermodynamically  nonideal. 
A  similar  excluded  volume  effect  has  been  indicated  to  occur 
in  soluble  proteins  at  high  protein  concentrations  or  in  the 
presence  of  small  molecules  that  induce  crowding  conditions 
(27).  To  determine  the  extent  of  thermodynamically  mean¬ 
ingful  preferential  interactions  between  the  SN— CCK4 
transmembrane  proteins,  it  is  first  necessary  to  distinguish 
them  from  the  protein  interactions  induced  by  the  limiting 
solvent  conditions. 

To  develop  a  model  describing  the  distribution  of  non¬ 
interacting  proteins  in  micelles,  we  employed  a  statistical 
approach  that  considers  distribution  of  proteins  in  micelles 
in  terms  of  the  random  occupancy  of  placing  balls  in  bins, 
where  the  protein  molecules  represent  balls  and  the  individual 
micelles  represent  bins.  Using  this  formalism,  the  random 
occupancy  of  any  particular  micelle  by  any  particular  protein 
molecule  can  be  described  as  a  Bernoulli  event  (e.g.,  either 
the  micelle  contains  that  protein  or  not),  which  can  be 
described  by  a  binomial  distribution: 

Pr[Xf  =  *]  =  ("' )p*(l  -p)'"-k  (1) 
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where  X,  is  the  random  variable  that  counts  the  number  of 
proteins  in  a  particular  micelle,  i,  k  is  the  number  of  proteins 
occupying  a  micelle  (e.g.,  the  “oligomeric  state”),  m  is  the 
number  of  protein  molecules,  p  is  the  probability  of  each 
Bernouilli  trial,  which  equals  1/n,  and  n  is  the  number  of 
micelles.  When  the  number  of  proteins  and  the  number  of 
micelles  both  exceed  the  allowed  occupancy  of  a  particular 
micelle,  k,  this  binomial  distribution  is  approximated  by  the 
Poisson  distribution: 

lim  (m,n  »  £):Pr[X(.  =  k]  %  e  mln  (2) 

To  consider  the  statistical  protein  distribution,  we  write  the 
partition  function,  P,  as  the  sum  all  those  probabilities  for 
which  k  >  0.  (e.g.,  for  this  purpose,  we  are  not  interested  in 
the  distribution  of  “empty”  micelles).  To  convert  each 
probability  to  a  relative  concentration,  each  term  is  multiplied 
by  the  number  of  proteins  that  the  probability  represents. 

P  =  ^>Pr[X,  =  k]  (3) 

k>  0 

The  fraction  of  any  particular  species  can  then  be 
calculated  as  a  function  of  the  mole  fraction  protein  (m/ri) 
in  the  micellar  phase  from  this  partition  function  by  dividing 
its  contribution  by  the  sum  of  all  species.  For  example,  the 
fraction  dimer  can  be  calculated  as 

2Pr[X,  =  2] 

/Dimer  =  (4) 

^kPr[X,  =  k\ 

k>  0 

The  exact  form  of  the  partition  function  and  the  distribu¬ 
tion  of  species  will  depend  on  the  allowed  values  of  k  in  the 
model.  In  this  study,  we  limited  the  species  to  monomer  and 
dimer.  In  addition,  we  postulate  that  an  unlimited  number 
of  proteins  in  a  micelle  would  not  be  physically  possible. 
Experimentally,  this  would  behave  as  an  aggregated  protein, 
and  we  approximated  this  as  a  micelle  containing  10 
monomers.  While  we  did  not  observe  any  aggregation  in 
our  sedimentation  equilibrium  experiments,  we  wished  to 
calculate  the  statistical  distribution  over  wide  range  of  mole 
fraction  values.  Including  this  parameter  in  the  Poisson  model 
allows  us  to  approximate  when  this  might  occur,  and  by 
including  this  term  in  the  partition  function,  we  confirmed 
that  decamer  is  not  a  significant  species  within  the  concen¬ 
tration  range  shown  in  Figure  3  (data  not  shown).  Thus,  for 
clarity,  we  subsequently  removed  this  term,  and  the  final 
partition  function  contains  only  monomeric  and  dimeric 
species. 

A  final  parameter  in  this  statistical  model  is  required  to 
convert  micelles  to  total  micellar  detergent  concentration, 
where  the  latter  is  the  experimental  variable.  For  protein- 
free  micelles,  we  assumed  an  aggregation  number  for  C14 
betaine  micelles  of  100,  which  lies  intermediate  between  the 
published  values  for  C12  betaine  and  Cl 6  betaine  micelles 
(28,  29).  For  protein-containing  micelles,  we  recognize  that 
the  number  of  detergents  bound  may  or  may  not  be  similar 
to  the  protein-free  aggregation  number,  and  it  may  also  be 
that  the  monomeric  and  dimeric  forms  of  the  protein  would 
be  solvated  by  a  micelle  of  a  different  size.  Under  the 
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The  epidermal  growth  factor  receptors  (erbB)  constitute  an  important  class 
of  single  pass  transmembrane  receptors  involved  in  the  transduction  of 
signals  important  for  cell  proliferation  and  differentiation.  Receptor 
association  is  a  key  event  in  the  signal  transduction  process,  but  the 
molecular  basis  of  this  interaction  is  not  fully  understood.  Previous 
biochemical  and  genetic  studies  have  suggested  that  the  single  trans¬ 
membrane  helices  of  these  receptor  proteins  might  play  a  role  in  stabilizing 
the  receptor  complexes.  To  determine  if  the  erbB  transmembrane  domains 
could  provide  a  driving  force  to  stabilize  the  receptor  dimers,  we  carried 
out  a  thermodynamic  study  of  these  domains  expressed  as  C-terminal 
fusion  proteins  with  staphylococcal  nuclease.  Similar  fusion  constructs 
have  been  used  successfully  to  investigate  the  oligomerization  and 
association  thermodynamics  of  a  number  of  transmembrane  sequences, 
including  that  of  glycophorin  A.  Using  SDS-PAGE  analysis  and  sedimen¬ 
tation  equilibrium  analytical  ultracentrifugation,  we  do  not  find  strong, 
specific  homo  or  hetero-interactions  between  the  transmembrane  domains 
of  the  erbB  receptors  in  micellar  solutions.  Our  results  indicate  that  any 
preferential  interactions  between  these  domains  in  micellar  solutions  are 
extremely  modest,  of  the  order  of  lkcalmol-1  or  less.  We  applied  a 
thermodynamic  formalism  to  assess  the  effect  of  weakly  interacting  TM 
segments  on  the  behavior  of  a  covalently  attached  soluble  domain.  In  the 
case  of  the  ligand-bound  EGFR  ectodomain,  we  find  that  restriction  of  the 
ectodomain  to  the  micellar  phase  by  a  hydrophobic  TM,  even  in  the 
absence  of  strong  specific  interactions,  is  largely  sufficient  to  account  for  the 
previously  reported  increase  in  dimerization  affinity. 

©  2005  Elsevier  Ltd.  All  rights  reserved. 
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Introduction 

Transmission  of  molecular  signals  across  biologi¬ 
cal  membranes  is  essential  for  normal  cellular 
function.  The  epidermal  growth  factor  receptors 
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represent  an  essential  class  of  receptor  tyrosine 
kinases  (RTKs)  involved  in  the  transduction  of 
signals  important  for  cell  proliferation  and  differ¬ 
entiation.  In  humans,  this  family  includes  four 
receptor  proteins:  the  epidermal  growth  factor 
receptor  (EGFR,  erbBl,  HER1),  erbB2  (HER2), 
erbB3  (HER3),  and  erbB4  (HER4).  The  generally 
accepted  paradigm  for  signaling  by  these  receptors 
is  a  ligand-induced  rearrangement  of  the  extra¬ 
cellular  domains  followed  by  receptor  association. 
The  subsequent  trans-phosphorylation  of  the  intra¬ 
cellular  kinase  domains  results  in  downstream 
signaling.1,2  The  nature  of  the  signal  is  determined 
by  the  specific  identity  of  the  complex;  these 
receptors  are  able  interact  with  a  number  of 
hormone  ligands  to  form  a  variety  of  homo  and 
hetero-oligomeric  complexes.3  This  diversity  of 
interactions  yields  a  vast  signaling  network,  and 
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unregulated  signaling  within  the  network  is  impli¬ 
cated  in  a  number  of  human  cancers.4 

Discovery  of  the  molecular  basis  behind  the 
protein  association  reaction  in  the  erbB  receptor 
complexes  is  critical  to  understanding  the  signal 
transduction  process.  Recent  structures  of  the 
ligand-binding  domains  of  the  erbB  receptors 
have  begun  to  provide  insight  into  the  mechanisms 
of  signaling.  With  the  exception  of  erbB2,  it  is  now 
clear  that  the  extracellular  domains  (ECD)  of  these 
receptors  exist  in  an  auto-inhibited  conformation  in 
the  absence  of  ligand.  Instead  of  bridging  two 
receptors,  ligand-binding  induces  a  conformational 
change  in  the  ECD  that  reveals  a  cryptic  protein- 
protein  interface.5  In  contrast,  the  ECD  of  ErbB2,  for 
which  there  is  no  known  ligand,  exists  in  an  already 
extended  conformation  reminiscent  of  the  structure 
induced  in  the  ECD  of  the  other  erbBs  by  ligand¬ 
binding.  Thus,  the  recent  structures  revealed  that  an 
interface  between  any  two  erbB  proteins  must  be 
entirely  receptor-mediated.5 

While  the  crystal  structures  of  the  extracellular 
portions  of  the  erbB  proteins  revealed  a  receptor- 
receptor  interface,  the  extracellular  ligand-binding 
domains  alone  are  not  capable  of  recapitulating  all 
of  the  erbB  interactions  that  are  known  to  exist.  In 
some  cases,  the  ligand-binding  domain  is  not  even 
necessary  to  form  an  active  complex.  The  v-erbB 
oncogene  product  is  a  truncated  homolog  of  erbBl 
that  lacks  most  of  its  ligand-binding  domain,  but  is 
capable  of  dimerizing  and  transforming  cells  in  a 
ligand-independent  manner.8,9  Additionally,  a 
version  of  erbB2  with  a  truncated  N-terminal 
domain  also  leads  to  more  efficient  transformation 
than  its  full-length  counterpart,  even  though  the 
ECD  is  presumably  not  normally  auto-inhibited.1011 
Interestingly,  the  extracellular  domain  of  erbB2- 
(HER2)  is  proteolytically  cleaved  in  tumor  cells  to 
yield  a  soluble  fragment,  detectable  in  the  serum  of 
breast  cancer  patients,  and  an  active  membrane 
fragment.  The  domain  structure  of  the  membrane 
fragment,  with  just  a  transmembrane  segment  and  a 
tyrosine  kinase  domain,  is  reminiscent  of  the  v-erbB 
oncogene  product.12,13  Together,  these  results 
suggest  that  other  regions  of  the  protein  are 
important  for  stabilizing  and/or  regulating 
receptor-receptor  interactions,  and  it  raises  ques¬ 
tions  about  the  relative  contribution  of  the  single 
transmembrane  helix  and  the  intracellular  kinase 
domain  to  the  receptor  association. 

A  number  of  studies  have  suggested  that  specific 
interactions  between  the  transmembrane  helices  are 
in  fact  important  for  stabilizing  the  receptor  com¬ 
plexes.  Attention  first  focused  on  the  trans¬ 
membrane  domain  as  a  potentially  important  site 
of  interaction  when  the  neu  oncogene  was  identified 
as  a  point  mutation  of  valine  to  glutamic  acid  within 
the  transmembrane  (TM)  of  a  rat  ortholog  of 
erbB2.14  Molecular  modeling  and  solid-state  NMR 
experiments  suggest  the  glutamic  acid  might 
stabilize  receptor  dimers  by  forming  inter-helical 
hydrogen  bonds.15,16  The  transforming  effect  of  the 
neu  glutamic  acid  substitution  was  found  to 


depend  upon  the  surrounding  sequence  context,17 
as  well  as  absolute  position  in  the  TM.18  The 
surrounding  sequence  incorporates  a  five  amino 
acid  residue  motif,  Small-x-x-Large-G/A,  consist¬ 
ing  of  a  small  residue  (Gly,  Ala,  Ser,  Thr,  or  Pro)  in 
the  zero  position,  a  large  aliphatic  residue  (Ala,  Val, 
Leu,  or  lie)  in  position  3,  followed  by  Gly  or  Ala  in 
position  four.1'  This  motif  was  identified  in  a  large 
number  of  receptor  tyrosine  kinases,  suggesting  the 
motif  could  also  be  important  for  normal  receptor 
function.19  Recently,  it  was  postulated  that  the  two 
copies  of  the  sequence  motif  in  erbBl  have  distinct 
functional  roles,  one  being  important  for  homo¬ 
interactions  and  the  other  important  for  hetero¬ 
interactions  with  erbB2.20 

Additional  evidence  for  a  role  for  the  trans¬ 
membrane  segment  in  receptor  association  resulted 
from  a  study  of  the  ligand-induced  dimerization  of 
the  extracellular  domain  of  EGFR  (erbBl).  Extend¬ 
ing  this  fragment  to  include  the  membrane-span¬ 
ning  segment  enhanced  the  level  of  observed 
dimerization  upon  addition  of  the  EGF  ligand.21 
In  other  studies  in  vivo,  peptides  corresponding  to 
the  TM  regions  of  the  protein  are  capable  of 
specifically  inhibiting  their  cognate  receptor.22,23 
Interfering  with  the  transmembrane  domain  can 
also  affect  hetero-dimerization.  Replacement  of  the 
transmembrane  domain  of  erbB3  with  the  TM 
sequence  from  the  fibroblast  growth  factor  receptor 
or  with  a  minimal  lipid-anchor  impairs  hetero¬ 
dimerization  with  erbB2  relative  to  the  wild-type 
erbB3,  suggesting  the  TM  segment  functions  as 
something  more  than  a  membrane  tether.6 

More  recently,  it  was  found  that  all  four  of  the 
erbB  TM  segments  gave  strong  positive  responses 
in  TOXCAT,  a  genetic  assay  for  helix-helix  associ¬ 
ation.24  The  erbB  TM  domains  gave  TOXCAT 
signals  that  were  about  half  the  strength  of  wild- 
type  glycophorin  A  (GpA),  a  well-characterized  TM 
model  system  known  to  have  strong  helix-helix 
interactions.23,24  All  four  erbB  TOXCAT  responses 
were  significantly  stronger  than  that  of  the  dis¬ 
ruptive  GpA  mutant  G83I.24  This  response  level  in 
TOXCAT  can  be  interpreted  as  an  indication  of  a 
high  propensity  for  the  transmembrane  sequence  to 
self-associate  in  bacterial  membranes.  Moreover, 
mutational  analysis  showed  that  changing  residues 
in  the  Sternberg  &  Gullick  motif15,19  and  the  closely 
related  GG4  motifs25  affected  the  response  levels 
observed  in  TOXCAT.24  Surprisingly,  a  valine  to 
glutamic  acid  mutation  in  erbB2,  analogous  to  that 
found  in  rat  neu,  which  is  generally  believed  to 
stabilize  receptor  dimers,  resulted  in  a  reduction  in 
the  TOXCAT  signal.24 

Whereas  TOXCAT  appeared  to  be  reporting  on  a 
sequence-specific  interaction,  several  studies  indi¬ 
cate  active  receptors  are  actually  quite  promiscuous 
in  the  transmembrane  sequences  they  can  tolerate. 
EGFR  can  accommodate  a  number  of  proline 
substitutions  in  the  TM  and  remain  responsive  to 
EGF.26  Several  erbBl  constructs  with  extended 
or  truncated  transmembrane  domains  will  still 
bind  ligand  and  retain  their  kinase  activity.26,27 


Weak  Interactions  of  the  ErbB  TMs 


761 


Additionally,  the  transmembrane  domain  of  the  rat 
neu  receptor  can  be  reduced  to  a  simplified 
sequence  of  polyvaline  with  two  appropriately 
placed  glutamic  acid  residues  without  a  loss  of 
receptor  dimerization  or  activation.28  29  Thus, 
whether  the  transmembrane  domains  of  the  erbB 
receptors  provide  a  strong  and  specific  driving  force 
for  receptor  interaction  remains  an  open  question. 

In  this  study,  we  address  the  intrinsic  propensity 
of  the  erbB  transmembrane  sequences  to  self¬ 
associate  using  a  well-characterized  fusion  protein 
expression  system  coupled  with  sedimentation 
equilibrium  analytical  ultracentrifugation.  We 
employed  a  construct  in  which  the  erbB  trans¬ 
membrane  domains  were  fused  via  a  14  amino  acid 
residue  linker  region  to  the  C  terminus  of  staphy¬ 
lococcal  nuclease  (SN).  Similar  SN-TM  fusion 
constructs  have  been  used  to  successfully  probe 
the  sequence-dependent  oligomerization  of  several 
transmembrane  domains,  including  glycophorin  A, 
bNIP3,  phospholamban,  and  several  designed 
sequences.30' 3  Compared  to  these  helix-helix 
oligomers,  we  find  that  the  transmembrane 
domains  of  the  erbB  receptors  have  only  a  minimal 
propensity  for  interaction  in  micellar  solutions, 
despite  giving  strong  signals  in  TOXCAT. 

Results 

Table  1  shows  the  amino  acid  sequences  for  the 
TM  segments  of  the  four  wild-type  human  erbB 
receptors  that  were  used  in  this  study.  SNerbB2  was 
originally  cloned  without  C-terminal  charges,  but 
subsequent  inclusion  of  these  charges  in  the 
SNerbB2KRR  clone  did  not  significantly  impact 
the  results.  A  construct  in  which  the  rat  neu  valine 
to  glutamic  acid  mutation  was  mapped  onto  the 
human  erbB2  sequence  and  another  construct  with 
a  valine  to  isoleucine  mutation  that  has  been 
associated  with  a  lower  risk  of  cancer34,30  were 
also  cloned  and  expressed. 

Strong  dimerization  cannot  be  detected  in  SDS 
micelles 

The  association  of  the  SNerbB  fusion  constructs 
was  first  analyzed  by  SDS-PAGE.  This  method  can 
detect  stable  oligomeric  complexes  of  membrane 
proteins,  and  it  has  been  used  previously  to  map  the 


Figure  1.  SDS-PAGE  analysis  of  whole  cell  lysates  for 
proteins  expressing  SN  fusion  proteins  using  a  20% 
acrylamide  Phast  gel.  Arrows  indicate  the  position  to 
which  monomers  and  dimers  migrate.  Lane  1,  markers; 
lane  2,  -IPTG  SNGpA;  lane  3,  +IPTG  SNGpA;  lane  4, 
+  IPTG  SNerbBl;  lane  5,  +IPTG  SNerbB2,  lane  6,  +IPTG 
SNerbB3;  lane  7,  +  IPTG  SNerbB4,  lane  8,  markers. 


interfaces  of  several  TM  helix  oligomers.30'32  Since 
SDS  is  a  detergent  micelle  environment,  membrane 
proteins  may  retain  their  structure,  and  some  non- 
covalent  interactions  can  be  preserved.  Indeed,  the 
isolated  transmembrane  domain  from  the  rat 
homolog  of  erbB2,  protoneu,  has  been  shown  to 
retain  a  helical  secondary  structure  in  SDS 
micelles.36 

Figure  1  shows  the  SDS-PAGE  of  whole  cell 
lysates,  which  revealed  that  the  SNerbB  fusion 
proteins  migrated  as  monomers.  For  comparison, 
lysates  from  cells  expressing  SNGpA,  which 
migrates  predominantly  as  a  dimer  on  SDS-PAGE, 
are  also  shown.  Using  purified  protein,  much 
higher  concentrations  (>100  pM)  could  be  loaded 
onto  the  gel,  but  the  protein  still  migrated  as 
monomer  (data  not  shown).  Both  SNerbB2  mutants, 
SNerbB2neu  and  SNerbB2VI,  expressed  as  mono¬ 
mers,  but  the  proteins  were  ill-behaved  and  neither 
could  be  stably  purified  in  sufficient  quantities  for 
thermodynamic  studies. 

SNerbB  proteins  are  predominantly  monomeric 
in  C8E5 

In  the  case  of  the  glycophorin  A  transmembrane 
domain,  the  free  energy  of  the  helix-helix  inter¬ 
actions  is  known  to  be  weaker  in  SDS  relative  to 


Table  1.  A  list  of  the  transmembrane  sequences  considered  in  this  study,  given  by  single-letter  amino  acid  codes 
Clone  name  TM  AA  sequence 


SNerbBl 

SNerbB2 

SNerbB2KRR 

SNerbB2neu 

SNerbB2VI 

SNerbB3 

SNerbB4 


SIATGMVGALLLLLWALGIGLFMRRR 

LTSIVSAVVGILLWVLGVVFGILI 

LTSIVSAWGILLVWLGWFG1LIKRR 

LTSIVSAVEGILLVWLGWFG1LIKRR 

LTSIIVSAWGILLVVVLGVVFGILIKRR 

LTMALTVIAGLWIFMMLGGTFLYWRGRR 

LIAAGVIGGLFILVIVGLTFAVYVRRK 


These  sequences  were  fused  to  the  C-terminal  end  of  the  staphylococcal  nuclease  via  a  linker  region.  Mutations  are  in  bold  and 
underlined. 
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other  detergent  environments.37,38  Previously,  sedi¬ 
mentation  equilibrium  analytical  ultracentrifuga¬ 
tion  has  been  used  to  detect  significant  dimer 
populations  for  mutants  of  GpA  that  were  mono¬ 
meric  by  the  SDS-PAGE  assay.  9  In  contrast  to  SDS- 
PAGE,  analytical  ultracentrifugation  also  allows  for 
equilibrium  measurements  of  the  species  in 
solution  over  a  wide  concentration  range.  This 
approach  has  been  used  to  determine  the  thermo¬ 
dynamics  of  association  of  wild-type 
and  more  than  50  sequence  variants, 
to  more  rigorously  investigate  the  association  of  the 
erbB  transmembrane  segments,  we  carried  out 
sedimentation  equilibrium  experiments  in  the 
neutrally  buoyant  detergent  C8E5. 

The  SNGpA  construct  is  greater  than  90%  dimeric 
in  23  mM  C8E5  at  the  concentrations  used  in  the 
centrifuge,  and  even  the  very  disruptive  GpA 
mutant  G83I  is  greater  than  20%  dimeric  under 
these  conditions.  9  Therefore,  we  expected  that  we 
would  be  able  to  detect  moderately  stable  inter¬ 
actions  of  the  erbB  TMs  at  this  detergent  concen¬ 
tration.  Representative  data  sets  for  each  SNerbB 
protein  are  shown  in  Figure  2.  Using  a  non-linear, 
least-squares  global  analysis  procedure,  the  data  for 
each  of  SNerbBl,  SNerbB2,  and  SNerbB3  were  well 
described  by  a  model  for  a  single  ideal  species  with 


glycophorin  A 
&io  Therefore, 


the  molecular  mass  of  a  monomer.  Only  SNerbB4 
could  not  be  described  by  a  simple  monomer  model 
at  23  mM  C8E5,  as  evidenced  by  the  non-random 
residuals  in  Figure  2.  However,  if  the  concentration 
was  raised  to  50  mM  C8E5,  SNerbB4  was  mono¬ 
meric,  suggesting  the  interactions  in  C8E5  are 
reversible.  In  contrast,  most  GpA  mutants  have  a 
detectable  population  of  dimer  under  these  exper¬ 
imental  conditions.  These  initial  experiments 
suggest  interactions  between  the  erbB  TMs  are 
very  weak,  with  SNerbB4  showing  a  slightly  greater 
propensity  for  association  relative  to  the  other  three. 

At  limiting  protein  to  detergent  ratios,  SNerbB 
proteins  self-associate 

The  apparent  dissociation  constant  for  dimeriza¬ 
tion  of  the  GpA  TM  decreases  as  the  detergent 
concentration  is  decreased,  as  a  result  of  the 
reduced  volume  of  the  hydrophobic  phase  available 
to  the  protein.41  Therefore,  to  populate  the  SNerbB4 
oligomer(s),  as  well  as  possibly  detect  even  weaker 
interactions  for  the  other  erbB  TM  domains,  we 
decreased  the  detergent  concentration  in  the  sedi¬ 
mentation  equilibrium  experiments  to  11  mM,  a 
condition  under  which  the  disruptive  glycophorin 
A  mutant  G83I  would  be  greater  than  50%  dimeric. 


Figure  2.  SNerbB  proteins  are  monomeric  at  most  detergent  concentrations.  Representative  data  sets  from  equilibrium 
AUC  experiments  are  shown  for  the  four  erbB  fusion  constructs  at  23  mM  C8E5.  The  protein  distribution  was  monitored 
by  measuring  absorbance  at  280  nm.  Raw  data  are  shown  as  open  circles;  the  continuous  lines  represent  global  fits  for 
single  ideal  species  with  the  molecular  mass  of  a  monomer.  The  upper  panel  for  each  data  set  shows  the  residuals  for  the 
fit.  The  residuals  for  the  single  ideal  fit  are  small  and  random  for  SNerbBl,  SNerbB2,  and  SNerbB3,  indicating  those  data 
are  well  described  by  a  monomer  model.  The  residuals  for  SNerbB4,  however,  are  larger  and  non-random. 
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A  stochastic  distribution  model  predicts  some  non¬ 
specific  interactions  at  these  high  protein:detergent 
ratios  used  in  the  11  mM  C8E5  experiments,42  but 
we  sought  to  determine  if  we  could  drive  the 
system  to  a  preferential  oligomer. 

For  all  SNerbB  constructs,  the  data  at  11  mM  were 
no  longer  described  by  a  single  monomeric  species 
model  and  showed  evidence  for  protein  self¬ 
association.  However,  from  these  data  alone,  it 
was  difficult  to  fit  a  specific  association  model  (see 
discussion  below),  so  we  used  the  value  for  the 
reduced  molecular  mass,  sigma  (a),  as  a  model- 
independent  parameter  for  expressing  the  extent  of 
association,  where  ct  is  defined  by  Yphantis  as:43 


o  = 


M(1  —  up)w2 
RT 


(1) 


where  M  is  molecular  mass,  0  is  the  partial  specific 
volume  of  the  protein  (mlg-1),  p  is  the  solvent 
density  (g  mF1),  w  is  the  rotor  speed  (rad  s_1),  R  is 
the  universal  gas  constant,  and  T  is  the  temperature 
(K).  For  a  single  ideal  species  fit,  the  value  of  a 
represents  an  approximate  measure  of  the  weight- 
average  molecular  mass  of  the  species  in  solution. 
The  expected  value  of  c  for  a  given  protein  can  be 
calculated  from  the  amino  acid  sequence  and  the 
buffer  components.  If  a  protein  is  monomeric,  then 
the  experimental  value  for  ct  from  a  single  ideal 
species  model,  aexp,  would  be  equal  to  the 
calculated  value,  aca]c.  Thus,  a  ratio  of  aexp/acaIc 
greater  than  1  indicates  the  presence  of  higher-order 
species,  and  larger  ratios  can  indicate  a  greater 
extent  of  self-association.  For  SNerbBl,  SNerbB2, 
and  SNerbB3,  the  o  ratio  is  within  error  of  unity  at 
23  mM  C8E5,  whereas  the  SNerbB4  ratio  is  signifi¬ 
cantly  larger  than  1  under  the  same  conditions 
(Table  2).  Even  so,  the  experimental  ct  for  SNerbB4 
in  23  mM  C8E5  is  only  16%  larger  than  the  value 
calculated  for  monomer,  indicating  that  monomer  is 
still  a  significantly  populated  species.  In  contrast  to 
the  23  mM  data.  Table  2  shows  that  the  ct  ratios  for 
all  four  proteins  are  significantly  greater  than  1 
when  experiments  are  carried  out  in  11  mM  C8E5 
This  result  reflects  the  presence  of  higher-order 
species  in  all  four  proteins.  The  fact  that  the  data  at 
11  mM  C8E5  were  no  longer  well  described  by  a 
monomeric  species  alone  is  also  reflected  in  the 
increase  in  the  values  of  the  square-root  of  the 


variance  (SRV)  for  the  single  species  fits  as 
compared  to  the  SRVs  at  23  mM  C8E5  (Table  2). 

The  absence  of  strong  interactions  was  surpris¬ 
ing,  because  the  erbB  TM  domains  show  positive 
signals  in  the  genetic  assay  TOXCAT,  a  result  that  is 
generally  interpreted  as  an  indication  of  strong 
helix-helix  interactions.23,24  In  the  TOXCAT  assay, 
the  activity  of  the  erbB  TMs  was  approximately 
half  the  activity  of  wild-type  glycophorin  A.  More¬ 
over,  the  erbB  signals  were  several-fold  greater  than 
the  disruptive  glycophorin  A  mutant  G83I.  In 
contrast,  in  our  experiments,  the  SNerbB  proteins 
were  monomeric  under  conditions  where  even  G83I 
has  a  significant  population  of  dimer. 

Although  the  absolute  strengths  of  the  SNerbB 
interactions  were  much  weaker  than  expected  from 
the  TOXCAT  results  by  comparison  to  GpA,  a 
consideration  of  just  the  relative  signals  of  the  erbBs 
suggests  that  the  association  trends  in  bacterial 
membranes  as  measured  by  TOXCAT  are  mirrored 
in  the  extent  of  association  observed  in  the 
centrifuge  in  11  mM  C8E5.  This  comparison  is 
shown  in  Figure  3,  where  it  can  be  seen  that  in 
each  assay  the  TM  sequences  of  erbBl,  erbB2,  and 
erbB3  exhibit  about  the  same  levels  of  association, 
whereas  there  is  evidence  for  a  somewhat  greater 
extent  of  association  for  erbB4.  Although  SNerbB4 
exhibits  a  somewhat  greater  propensity  for  inter¬ 
action  than  the  other  three  sequences,  the  inter¬ 
actions  of  all  four  proteins  in  micellar  solutions  are 
very  weak,  especially  by  comparison  to  the  GpA 
sequences.39 

Specific  association  models  for  SNerbBs  are 
difficult  to  define 

We  fit  the  sedimentation  equilibrium  data  with  a 
number  of  self-association  models  to  determine  the 
nature  of  species  present  in  11  mM  C8E5.  We  tried  a 
number  of  monomer-Nmer  models  and  found  that 
for  all  four  proteins  a  monomer-trimer  model  was 
the  best  description  of  the  data  from  these  simple 
two-species  models.  Table  3  shows  the  results  of 
these  fits  for  one  of  the  experiments  with  SNerbB4, 
and  results  were  similar  for  the  other  three  proteins. 
The  fact  that  a  monomer-trimer  equilibrium  was 
the  best  fit  to  the  data  from  the  simple  self¬ 
association  models  was  surprising,  since  the 


Table  2.  Analysis  of  SNerbB  AUC  data  using  single  ideal  fits 


23  mM  C8E5  11  mM  C8E5 


SNerbB 

^calc 

SRV  (10  ~3) 

^exp 

^calc/ ^exp 

SRV  (10“3) 

^exp 

^calc/^exp 

1 

0.9575 

5.55 

1.0036 

1.0481 

7.95 

1.3657 

1.4263 

2 

0.9287 

6.71 

0.9844 

8.18 

1.2943 

1.3937 

3 

0.9841 

5.79 

1.0581 

1.0751 

7.39 

1.3593 

1.3813 

4 

0.9533 

7.79 

1.1088 

1.16311 

9.06 

1.6260 

1.7057 

At  detergent  concentrations  of  23  mM  or  greater,  the  SNerbB  proteins  are  monomeric  in  solution.  When  the  detergent  concentration  is 
reduced  to  11  mM,  evidence  of  higher-order  species  can  be  seen.  The  values  of  o  for  the  four  SNerbB  fusion  constructs  were  calculated 
using  SEDNTERP  for  20,000  rpm.  aL.xp  is  the  value  returned  by  the  NONLIN4'  fitting  procedure  for  a  single  ideal  species  at  20,000  rpm. 
SRV  is  the  square-root  of  the  variance  for  these  fits.  The  experimental  numbers  in  the  Table  represent  the  average  of  at  least  two 
independent  measurements. 
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Figure  3.  A  comparison  of  the  relative  CAT  activities  of 
the  erbB  transmembrane  sequences  to  the  a  ratios  from 
11  mM  CgEs  determined  by  sedimentation  equilibrium 
using  a  single  ideal  species  fit.  It  can  be  seen  that  in  both 
assays,  the  transmembrane  segment  of  erbB4  appears  to 
be  the  most  associated  of  the  four  transmembrane 
domains. 
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biology  of  the  erbB  receptors  suggests  that  a 
monomer-dimer  equilibrium  would  likely  be  the 
most  appropriate  model  to  describe  the  data.44,45 
Nevertheless,  Figure  4  shows  a  comparison  of  the 
species  plots  for  these  fits,  where  it  can  easily  be 
seen  from  the  non-random  residuals  that  a  mono¬ 
mer-dimer  equilibrium  is  not  as  good  a  description 
of  the  data  as  a  monomer-trimer  model. 

Therefore,  to  independently  identify  the  oligo¬ 
mers  present  in  solution  and  to  better  discriminate 
between  models  for  the  sedimentation  equilibrium 
data,  we  carried  out  glutaraldehyde  cross-linking 
experiments.  Cross-linking  techniques  can  capture 
the  oligomers  that  are  preferentially  populated  in 
solution.  The  fusion  proteins  were  cross-linked  in 
11  mM  C8E5  at  the  highest  protein  concentrations 
used  in  the  centrifugation  experiments.  Figure  5 
shows  that  bands  for  both  dimers  and  trimers  could 
be  seen,  as  well  as  traces  of  higher-order  aggregates. 
At  extended  times,  the  SNerbB  proteins  all  cross¬ 
link  as  large  aggregates.  The  cross-linking  results 
did  confirm  the  presence  of  a  dimer  species,  but 
they  also  revealed  that  it  is  likely  at  least  three 
species  are  present  in  solution,  which  may  be  why  a 
unique  fit  using  the  AUC  data  alone  was  difficult  to 
define. 

The  cross-linking  results  and  the  biology  of  the 


Radius,  cm 


Table  3:  Various  association  models  fit  to  data  for 
SNerbB4  in  IlmmCsEs 
Model  SRV 


Single  Ideal 

8.56e-3 

Monomer-Dimer 

7.31  e-3 

Monomer-T rimer 

6.49e-3 

Monomer-Tetramer 

7.92e-3 

Monomer-Dimer-Trimer 

6.42e-3 

Monomer-DImer-Tetramer 

6.45e-3 

Figure  4.  Species  plots  for  various  association  models  fit  to  data  for  SNerbB4  in  11  mM  C8E5.  Although  only  one  data 
set  is  shown,  a  global  fitting  procedure  using  nine  data  sets,  three  concentrations  at  three  rotor  speeds  was  employed.  It 
was  expected  that  the  equilibrium  for  SNerbB4  would  be  described  by  a  monomer-dimer  equilibrium,  but  higher  order 
models  appear  to  fit  equally  as  well  if  not  better,  thus  making  it  difficult  to  distinguish  which  is  the  appropriate 
description  of  the  system. 
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Figure  5.  Higher-order  SNerbB  oligomers  can  be 
visualized  by  SDS-PAGE  of  glutaraldehyde  cross-linked 
proteins.  The  proteins  were  cross-linked  in  11  mM  C8E5  at 
43  pM  with  0.1%  glutaraldehyde  and  run  on  a  12.5% 
acrylamide  Phast  gel.  Lane  1,  molecular  mass  markers; 
lanes  2-6:  0,  five,  ten,  30, 90,  and  120  seconds,  respectively. 
Bands  can  be  seen  for  both  dimers  and  trimers;  traces  of 
higher-order  aggregates  can  also  be  seen.  At  long  times, 
all  the  proteins  cross-link  as  large  aggregates  that  crash 
out  of  solution. 


erbBs  justify  the  use  of  three  species  models  that 
include  dimers  for  the  analysis  of  the  sedimentation 
equilibrium  data.  Therefore,  we  fit  the  data  with  a 
monomer-dimer-trimer  model,  and  the  statistics  of 
the  fit  and  the  shapes  of  the  residuals  indicated  the 
model  was  a  reasonable  description  of  the  data 
(Figure  4).  Separation  of  the  trimer  terms,  which  can 
be  interpreted  as  a  non-equilibrating  population  for 
that  species,  led  to  an  improvement  in  the  fit.  The 
inclusion  of  the  additional  parameters  was  supported 
by  the  Akaike  information  criterion46  statistical  test 
and  analysis  of  the  residuals  by  a  runs  test.47 

Stochastic  association  largely  accounts  for  the 
observed  interactions 

The  observed  species  distributions  for  the 
SNerbB  proteins  in  11  mM  C8E5  were  compared  to 
what  is  predicted  by  a  purely  stochastic  distribution 
under  these  conditions.  At  these  high  protein  to 
detergent  ratios  where  the  number  of  micelles 
approaches  the  number  of  protein  molecules,  the 
probability  that  more  than  one  protein  molecule  can 
occupy  a  given  micelle  in  the  absence  of  a 
preferential  interaction  is  non-negligible,  and  the 
distribution  of  proteins  in  micelles  is  given  by  a 
Poisson  distribution  as  described  in  Materials  and 
Methods.42  Figure  6  shows  a  comparison  of  the 
experimentally  observed  species  plots  for  SNerbB4 
to  those  predicted  by  a  stochastic  model.  The 


comparison  shows  that  the  fraction  of  monomer 
for  SNerbB4  decreases  more  quickly  than  one 
would  expect  from  purely  stochastic  interactions. 
It  also  demonstrates  that  the  fraction  of  dimers 
expected  from  a  random  distribution  is  non- 
negligible  at  the  concentrations  used  in  the  sedi¬ 
mentation  equilibrium  experiments,  highlighted  by 
the  thickened  portion  of  the  line.  The  presence  of 
yet  another  distinct  species,  namely  the  stochastic 
dimer,  may  also  have  contributed  to  the  difficulty  in 
defining  a  model  for  the  AUC  data  at  11  mM  C8E5. 
Attempts  were  made  to  fit  the  data  with  models  that 
included  a  parameter  to  approximate  the  stochastic 
dimer,  but  this  did  not  improve  the  fits.  SNerbB4 
exhibits  the  greatest  propensity  of  the  erbBs  to 
associate,  so  the  curves  for  SNerbBl,  SnerbB2,  and 
SNerbB3  were  shifted  even  closer  to  the  random 
distribution  curves.  They  do  remain  to  the  left  of  the 
simulated  data,  indicating  that  they  are  slightly 
more  associated  than  one  would  expect  from  a 
purely  random  distribution.  FJowever,  we  estimate 
that  any  preferential  interactions  between  the  erbB 
TMs  are  extremely  modest,  certainly  less  than 
lkcalmol-1,  and  we  conclude  that  stochastic 
interactions  account  for  a  large  portion  of  the 
SNerbB  association  seen  in  11  mM  C8Es.  It  is 
worth  noting  though  that  at  23  mM  C8E5,  the 
stochastic  model  predicts  a  negligible  amount  of 
association  but  SNerbB4  is  not  monomeric  under 
these  conditions. 

Heterodimers  of  the  SNerbB  proteins  could  not 
be  detected 

Since  interactions  in  the  transmembrane  domains 
have  been  implicated  in  the  heterodimerization  of  the 
erbB  receptor  tyrosine  kinases,6'20  we  carried  out 
experiments  to  detect  hetero-interactions  of  the 
SNerbB  fusions  using  SDS-PAGE.  For  each  of  the 
six  pairwise  interactions,  equal  amounts  of  both 
proteins  were  combined,  but  only  monomer  was 
visible  on  the  gel  when  the  samples  were  analyzed  by 
SDS-PAGE  (data  not  shown).  We  also  attempted  to 
detect  modest  interactions  by  sedimentation  equili¬ 
brium  between  SNerbB2  and  SNerbB3.  We  chose  this 
pair  because  the  erbB2/erbB3  heterodimer  is  the  most 
potent  signaling  complex48,49  and  only  weak  hetero¬ 
interactions  can  be  detected  between  the  soluble 
ligand-binding  domains,7  so  we  hypothesized  that  if 
TM-TM  interactions  were  important  for  stabilizing 
any  of  the  hetero-complexes  it  would  be  the  erbB2/ 
erbB3  complex.  Our  results,  however,  showed  that  an 
equimolar  amount  of  SNerbB2  and  SNerbB3  at 
23  mM  C8E5  yielded  a  molecular  mass  consistent  with 
protein  monomers  in  solution  (Figure  7). 


Discussion 

ErbB  TM  interactions  are  difficult  to  detect  in 
micellar  solutions 
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Figure  6.  An  overlay  of  a  stochas¬ 
tic  distribution  and  the  distribution 
calculated  from  a  monomer-dimer- 
separate  trimer  fit  to  one  of  the 
SNerbB4  data  sets.  The  dotted  lines 
represent  the  monomer,  dimer,  and 
trimer  populations  simulated  using 
a  random  distribution  of  proteins 
into  micelles  for  an  aggregation 
number  of  60  and  a  maximum 
occupancy  of  k=5.  The  continuous 
lines  represent  the  distributions 
calculated  using  the  global  Ki2 
constant  and  the  Ki3  from  the  data 
set  for  the  middle  concentration  at 
24,500  rpm  for  11  mM  C8E5.  The 
thickened  black  portion  of  the  line 


represents  the  concentration  region  where  the  data  were  actually  collected.  It  can  be  seen  that  the  association  of  SNerbB4 


is  only  very  moderately  shifted  from  what  one  would  expect  from  a  stochastic  distribution. 


force  for  protein  association  is  not  unprecedented, 
and  several  TM  segments  have  been  identified  that 
are  capable  of  driving  oligomerization.30-32  Much 
consideration  has  been  given  as  to  whether  this  is 
the  role  of  the  erbB  transmembrane  domains.  Some 
studies  have  indicated  that  active  EGF  receptors  can 
tolerate  a  variety  of  transmembrane  sequences,26-29 
but  others  studies  have  suggested  a  specific  role  for 
the  transmembrane  domain  in  dimerization,  with 
perhaps  the  most  suggestive  evidence  for  a  role  for 
TM  helix  interaction  coming  from  the  genetic  assay 
TOXCAT.15'21'24 

Contrary  to  what  was  expected  from  the  TOX¬ 
CAT,24  we  found  that  the  transmembrane  domains 
of  the  erbB  receptors  were  not  sufficient  to  drive  the 
strong  dimerization  of  an  SN  fusion  partner  in 
detergent  micelles.  This  distinguishes  the  erbB  TM 
sequences  from  the  transmembrane  domains  of 
GpA  and  bNIP3,  which  are  capable  of  driving 
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Figure  7.  A  representative  sedimentation  equilibrium 
data  set  for  a  mixture  of  equimolar  amounts  of  SNerbB2 
and  SNerbB3  fit  to  a  single  ideal  species.  The  raw  data  are 
represented  as  open  circles  and  the  continuous  line 
represents  the  fit  and  the  upper  panel  shows  the  residuals 
for  the  fit.  The  data  are  well  described  by  a  single  ideal 
species  with  a  molecular  mass  of  21,273  kDa,  which  is 
equal  to  the  average  of  the  two  monomers,  SNerbB2 
(20,876  kDa)  and  SNerbB3  (21,669  kDa). 


robust  dimerization  of  chimeric  proteins  in  a 
number  of  environments,  including  detergent 
micelles  and  bacterial  membranes.30-32'37'39'50-  2  It 
was  also  initially  surprising  that  the  erbB  TMs  did 
not  interact,  given  that  the  sequences  contain 
GxxxG  motifs,  which  can  be  an  important  sequence 
motif  in  strong  helix-helix  interactions.25'31  How¬ 
ever,  it  has  been  subsequently  shown  that  a  GxxxG 
motif  is  not  sufficient  to  drive  TM  helix  dimeriza¬ 
tion.39,42  Our  results  are  also  consistent  with  an 
earlier  report  of  a  similar  SN/erbBl  TM  fusion 
construct  migrating  as  a  monomer  on  SDS-PAGE.53 

It  is  possible  that  the  lack  of  interaction  is  a 
consequence  of  the  SN  fusion  construct  that  was 
used,  but  we  believe  this  is  unlikely.  Our  linker 
region  is  14  amino  acid  residues  long  and  is 
comparable  to  what  has  been  used  in  the  past  to 
successfully  measure  the  energetics  of  glycophorin 
A,  a  number  of  GpA  mutants,  and  other  TM 
helices.31,33'37,39'40  In  all  of  these  studies,  the  SN 
fusion  did  not  appear  to  interfere,  and  the  linker 
length  was  sufficient  to  allow  the  TMs  to  interact. 
Additionally,  extending  the  linker  between  the  SN 
domain  and  the  TM  sequence  had  no  effect  on  the 
interaction  behavior  of  the  CCK-4  TM  domain.42 
Furthermore,  it  has  been  reported  that  a  peptide 
corresponding  to  the  TM  of  the  EGF  receptor 
(erbBl)  migrated  as  a  monomer  in  SDS-PAGE,54 
and  isolated  erbB2  TM  peptides  also  migrate 
predominantly  as  monomers.  These  results  with 
isolated  TM  peptides  are  consistent  with  the  SDS- 
PAGE  analysis  of  our  SN-fusion  constructs.  Some 
reports  do  indicate,  however,  peptides  correspond¬ 
ing  to  the  TM  of  the  proto-oncogenic  neu  receptor 
from  rats,  a  homolog  of  erbB2,  migrate  as  some¬ 
thing  larger  than  monomers  on  SDS-PAGE.36'55'56 

While  specific  protein-protein  interactions  deter¬ 
mined  by  the  amino  acid  sequence  are  important  for 
TM  helix  interactions,  the  environment  can  also  be  a 
critical  factor  in  the  investigation  of  membrane 
proteins.  Successful  solution  studies  can  depend  on 
the  choice  of  an  appropriate  detergent.  However, 
our  inability  to  detect  strong  interactions  does  not 
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seem  to  be  simply  the  effect  of  the  selected 
experimental  environment,  as  the  protein  did  not 
interact  in  a  number  of  detergents.  The  SNerbB 
proteins  are  principally  monomeric  in  the  non-ionic 
detergent  C8E5,  as  well  as  in  the  ionic  detergent 
SDS.  Preliminary  sedimentation  equilibrium 
experiments  in  the  zwitterionic  detergent  SB-C14 
also  failed  to  detect  strong  SNerbB  interactions 
(data  not  shown). 

Detergent  micelles  have  proven  to  be  powerful 
and  useful  tools  in  many  membrane  protein  studies, 
but  detergent  micelles  are  sometimes  criticized  as 
being  poor  substitutes  for  the  two-dimensional 
fluid  of  a  lipid  bilayer.  In  the  case  of  the  erbB 
proteins,  it  may  be  that  the  bilayer  imposes 
structural  constraints  that  favor  dimerization-com¬ 
petent  conformations  of  the  erbB  transmembrane 
domains  but  these  constraints  would  be  lacking  in  a 
micellar  environment.  The  results  from  the  TOX- 
CAT  assay,  which  is  carried  out  in  bacterial 
membranes,  are  certainly  consistent  with  this 
idea.24  However,  the  populations  in  solid-state 
NMR  experiments  in  model  bilayers  seem  to  be  in 
agreement  with  the  level  of  association  we  observe. 
While  the  solid-state  NMR  experiments  are  not  a 
direct  measure  of  association  and  can  be  difficult  to 
interpret,  these  studies  suggest  that  the  erbBl  TM  is 
largely  monomeric  in  bilayers,  with  the  only  signs 
of  interactions  being  seen  at  very  high  concen¬ 
trations  of  protein.5  61  The  erbB2  TM  appears  to 
exhibit  a  somewhat  greater  tendency  to  associate  in 
the  solid-state  NMR  experiments,  but  there  is  still  a 
significant  population  of  molecules  that  behave  as 
one  would  expect  for  an  unrestricted  monomer.61 
These  NMR  experiments  suggest  that  the  erbB  TM 
peptides  are  not  constitutively  associated  in 
bilayers,  and  thus  our  inability  to  detect  strong 
interactions  between  the  erbB  TM  domains  may  not 
be  the  consequence  of  the  micellar  environment 
required  for  the  sedimentation  equilibrium  studies. 
We  cannot,  however,  eliminate  the  possibility  that 
interactions  we  detect  as  extremely  weak  in  solution 
are  enhanced  by  the  environment  of  the  native 
bilayer  and  are  biologically  relevant. 

Another  possible  explanation  for  the  apparent 
discrepancy  between  TOXCAT  and  the  other  assays 
for  helix-helix  interactions  is  the  difference  in  the 
domains  flanking  the  transmembrane  segment.  In 
the  full-length  receptors,  the  transmembrane  helix 
connects  a  ligand-binding  ectodomain  and  an 
intracellular  kinase  domain.  It  may  be  that  the  TM 
segments  of  the  erbB  receptors  cannot  function  as 
independent  interaction  domains  isolated  from  the 
context  of  the  full-length  protein.  This  deficiency 
may  somehow  be  compensated  for  by  the  nature  of 
the  soluble  domains  in  the  TOXCAT  chimera.  Of  the 
studies  considered  here,  the  TOXCAT  construct  is 
the  only  construct  that  has  a  soluble  domain  with  a 
dimerization  interface,  in  the  form  of  the  ToxR 
DNA-binding  domain.  The  TOXCAT  construct  is 
also  the  only  construct  with  a  C-terminal  soluble 
domain.  It  may  be  that  the  erbB  TMs  require 
interactions  in  the  soluble  domains  to  pre-arrange 


the  TM  helices  and  that  the  TOXCAT  fusion  protein 
has  an  architecture  to  accomplish  this.  It  could  also 
be  that  the  presence  of  a  structured  C-terminal 
domain  is  important  for  stabilizing  interactions  at 
the  end  of  the  transmembrane  segment  necessary 
for  the  receptor  association.  Previously,  it  has  been 
suggested  that  the  erbB  transmembrane  helix  and 
the  intracellular  kinase  domain  are  rotationally 
coupled,  implying  some  degree  of  rigidity  in  the 
linker  between  the  two.28  It  is  also  worth  noting  that 
this  C-terminal  juxtamembrane  region  of  EGFR 
contains  an  important  phosphorylation  site  for 
protein  kinase  C62'63  and  that  the  addition  of  peptides 
corresponding  to  this  juxtamembrane  region  leads  to 
increased  receptor  autophosphorylation.64 

Consequences  of  including  a  TM  lacking  strong 
specific  self-associations 

In  studies  of  SNGpA  fusion  proteins,  it  has  been 
shown  that  it  is  appropriate  to  treat  the  micelles  as  a 
distinct  thermodynamic  phase  and  to  consider 
the  transmembrane  segments  confined  to  this 
phase.41 ,65/66  Such  a  consideration  leads  to  the 
expression:41 

AGapp  =  AG?  +  RT  ln[Micellar  Det]w  (2) 

where  AGapp  is  the  observed  apparent  free  energy 
in  bulk  aqueous  concentrations,  AG?  is  the  standard 
state  mole  fraction  free  energy  of  association,  R  is 
the  universal  gas  constant,  T  is  the  temperature  (K), 
and  [Micellar  Det]w  is  the  aqueous  concentration  of 
the  detergent  in  micelles.  The  first  term  represents 
the  association  free  energy  of  the  intrinsic  chemical 
components  of  the  association  reaction,  and  the 
second  term  represents  a  statistical  energy  of 
mixing,  the  magnitude  of  which  varies  with  the 
micellar  detergent  concentration.  In  the  case  of 
GpA,  we  believe  the  contribution  of  the  soluble  SN 
domain  to  the  standard  state  free  energy  AG?  is 
negligible,  and  we  interpret  AG?  as  the  contribution 
from  interactions  between  the  transmembrane 
domains. 

However,  the  soluble  domain  of  EGFR,  when 
ligand-bound,  does  interact.  Equation  (2)  predicts 
that  extending  these  fragments  to  include  a  hydro- 
phobic  transmembrane  segment  should  enhance 
the  apparent  dimerization  affinity  of  the  ECDs  even 
in  the  absence  of  specific  TM  interactions.  The  TM 
domain  would  be  restricted  to  the  volume  of  the 
micellar  phase,  and  thus  the  apparent  free  energy  of 
association  of  the  ECDs  would  now  contain  a 
contribution  from  the  statistical  energy  term 
described  above  as  well  as  the  intrinsic  free  energy 
of  dimerization  of  the  soluble  domains,  AGint. 
Applying  the  above  formalism,  we  can  substitute 
A Gint  for  AG?  and  rearrange  equation  (2)  to  write 
the  apparent  association  equilibrium  constant  for 
an  ECD-TM  construct  in  micelles,  Kapp,  as: 

Kapp  =  [Micellar  Det]w  (3) 
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The  equilibrium  constants  in  equation  (3)  are  in 
bulk  molar  concentration  units.  From  equation  (3), 
one  would  expect  that  the  association  of  the 
receptor  would  depend  on  detergent  concentration, 
with  a  tighter  apparent  association  expected  at 
lower  detergent  concentrations,  and  it  has  in  fact 
been  shown  that  the  full-length  EGFR,  as  well  as  a 
truncated  fragment  resembling  v-erbB,  interact 
more  strongly  at  low  detergent  concentrations.67,68 

A  study  by  Tanner  &  Kyte,  which  compared  the 
ligand-induced  dimerization  of  the  ECD  of  erbBl  to 
the  dimerization  of  a  fragment  extended  to  include 
the  TM,  is  also  consistent  with  this  formalism.  This 
study  found  the  longer  fragment  did  indeed 
associate  more  readily,  although  complete  dimer¬ 
ization  was  never  observed.21  The  dimerization  of 
the  longer  fragment  was  measured  in  1%  Triton 
X-100  (15  mM),  and  equation  (3)  predicts  that 
restriction  of  the  soluble  fragment  to  the  micellar 
phase  would  lead  to  an  apparent  67-fold  enhance¬ 
ment  in  the  association.  By  comparing  the  dimer¬ 
ization  of  the  ECD  alone,  which  had  been  shown  to 
be  complete  at  40  pM  protein,69  with  the  dimeriza¬ 
tion  of  the  ECD-TM  fragment  in  Triton  micelles  that 
they  observed  (2  nM),  the  authors  estimated  an 
observed  enhancement  of  approximately  10,000- 
fold.21  However,  a  subsequent  study  reports  com¬ 
plete  ligand-induced  dimerization  of  the  extra¬ 
cellular  domain  of  erbBl  at  concentrations  as  low 
as  4  pM  protein;7  therefore,  the  enhancement  of 
ligand-induced  dimerization  of  the  ECD  by  the 
addition  of  the  TM  may  be  less  than  originally 
suggested  and  be  more  of  the  order  of  1000- fold. 
Equation  (3)  suggests  a  large  portion  of  this  is 
simply  the  result  of  restricting  the  longer  fragment 
to  be  stably  inserted  in  the  micelles.  If  approxi¬ 
mately  65-fold  arises  from  limiting  the  protein  to 
the  micellar  phase,  only  a  15-30-fold  enhancement 
arises  from  other  features  of  the  ECD-TM  construct. 
This  additional  enhancement  is  modest,  and  it 
could  arise  from  stabilization  in  the  longer  construct 
of  structures  in  the  juxtamembrane  region  import¬ 
ant  for  dimerization  or  from  restriction  of  the 
relative  orientation  of  two  ECDs  on  the  surface  of 
a  micelle.  The  role  of  the  transmembrane  domain  in 
the  differences  observed  in  the  Tanner  &  Kyte  study 
is  still  unclear,  but  our  results  suggest  that  it  may 
not  be  necessary  to  invoke  strong  specific  inter¬ 
actions  between  the  TM  domains  to  account  for  the 
observed  enhancement. 

A  sterically  permissible  hydrophobic 
transmembrane  segment  may  be  sufficient 

Some  studies  have  indicated  active  receptors 
tolerate  a  good  deal  of  sequence  promiscuity  in 
the  transmembrane  segment,26,29  but  others  have 
implicated  sequence  specificity  in  the  TM  is 
necessary  to  maintain  proper  receptor  interactions 
in  the  EGFR  family.17,18,2*  If  there  are  not  strong 
interactions  between  the  transmembrane  domains, 
then  why  do  some  TM  sequence  variants  affect 
receptor  function?  This  dual  behavior  might  be 


reconciled  in  a  case  where  proper  orientation  of  the 
soluble  domains  puts  the  transmembrane  domains 
in  close  proximity  without  the  transmembrane 
segments  necessarily  providing  significant  energy 
to  stabilize  the  dimer.  In  other  words,  it  may  only  be 
required  that  the  transmembrane  region  dimers  be 
sterically  permissible.  Examination  of  the  crystal 
structures  of  the  extended  ligand-bound  confor¬ 
mation  of  the  erbB  extracellular  domains  and 
structure-based  models  of  the  receptor  dimers 
reveal  that  the  C  termini  of  domain  IV  point  toward 
one  another  in  way  that  could  juxtapose  the 
transmembrane  domains  of  two  receptors.70^73 
The  need  for  a  close-approach  of  the  trans¬ 
membrane  domains  might  explain  the  occurrence 
of  motifs  that  place  small  residues  on  the  same  face 
of  the  helix,  such  as  the  Stenberg  &  Gullick  and  GG4 
motifs. 

The  receptors  could  be  destabilized  by  mutations 
in  the  TM  that  introduce  large  steric  clashes  in 
regions  of  the  transmembrane  segments  positioned 
close  in  space  by  the  orientation  of  the  soluble 
domains.  For  mutations  to  have  this  effect,  there 
would  have  to  be  some  degree  of  rigidity  in  the 
structure  so  that  the  soluble  domains  and  the 
mutated  transmembrane  sequence  could  not  simul¬ 
taneously  adopt  their  optimal  conformations. 
A  previous  study  that  explored  the  rotational 
coupling  between  the  transmembrane  segment 
and  the  kinase  domain  of  the  neu  receptor  has  in 
fact  suggested  that  the  linker  between  the  two 
domains  is  rigid.28  Conversely,  sequence  variants 
that  could  be  juxtaposed  when  the  soluble  domains 
of  the  protein  adopt  their  preferred  conformation  in 
the  dimer  would  be  tolerated. 

Such  a  situation  would  be  consistent  with  what  is 
observed  for  the  neu  mutation.  The  wild- type 
valine  residues  may  be  in  close  proximity  in  the 
transmembrane  region,  but  not  interacting  to  any 
significant  extent.  Placing  a  glutamic  acid  residue  in 
an  already  optimal  position  introduces  a  hydrogen¬ 
bonding  interaction,  in  an  otherwise  non-interacting 
domain,  which  stabilizes  the  dimer.  The  subsequent 
introduction  of  an  unfavorable  interaction,  such  as 
one  might  expect  if  glycine  residues  in  close 
proximity  are  mutated  to  larger  residues,  would 
then  hinder  the  receptor  association.17  When  placed 
in  a  non-optimal  position  elsewhere  in  the  trans¬ 
membrane  domain,  no  effect  is  observed.18  It  may 
be  the  case  that  the  sterics  of  the  soluble  domains 
prevent  the  glutamic  acid  residues  in  the  trans¬ 
membrane  domains  from  interacting.  Alternatively, 
it  may  be  an  effect  of  the  sequence  context  of  the  TM 
itself/4  Since  a  number  of  studies  have  shown  that 
the  introduction  of  a  single  appropriately  placed 
polar  residue  in  a  TM  can  lead  to  association  of  an 
otherwise  non-interacting  TM  domain,  '  '  we  do 
not  believe  that  it  is  a  prerequisite  for  the  wild-type 
erbB  transmembrane  domains  to  have  significant 
interactions  in  order  for  the  neu  mutation  to  effect 
cellular  transformation  by  stabilization  of  receptor 
dimers. 
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Conclusions 

Contrary  to  what  was  expected,  given  the 
behavior  of  the  erbB  transmembrane  segments  in 
TOXCAT,  we  do  not  detect  strong  specific  inter¬ 
actions  between  the  TM  segments  of  the  erbB 
proteins  in  detergent  micelles  using  an  SN-TM 
fusion  system.  This  is  in  contrast  to  the  transmem¬ 
brane  domains  of  GpA  and  bNIP3,  which  give 
strong  TOXCAT  signals  and  are  capable  of  driving 
stable  protein  oligomerization  in  a  number  of 
environments.  Our  study  suggests  that  one 
interpretation  of  the  results  is  the  transmembrane 
segments  of  the  erbB  proteins  isolated  from  the  full- 
length  protein  are  not  independent  association 
domains  and  cannot  drive  oligomerization,  at 
least  in  a  micellar  environment.  It  may  be  that 
interactions  we  detect  in  solution  as  extremely 
modest  are  significantly  enhanced  in  a  bilayer 
environment.  However,  given  that  the  biology  of 
the  receptor  dictates  that  protein  exists  in  a 
monomer-dimer  equilibrium,  it  may  not  be  surpris¬ 
ing  that  TM  interactions  are  distinctly  weaker  than 
for  those  of  glycophorin  A.  Even  a  non-interacting 
hydrophobic  transmembrane  domain,  however,  can 
increase  the  apparent  affinity  of  the  covalently 
attached  soluble  domain  by  restricting  the  protein 
to  the  hydrophobic  phase  of  the  solution.  The  role  of 
the  transmembrane  domain  in  the  context  of  the 
full-length  protein  remains  unclear,  but  it  may  be 
sufficient  for  the  transmembrane  sequences  of  the 
erbB  proteins  to  be  sterically  permissible.  In  this 
scenario,  the  transmembrane  domains  are  passive,  in 
the  sense  that  they  do  not  contribute  significant 
energy;  however,  not  just  any  sequence  will  suffice 
because  the  packing  of  the  TM  segments  must  not 
interfere  with  the  association  of  the  other  domains. 
The  contribution  of  the  individual  domains  of  the 
receptors  to  the  full-length  complexes  is  unclear,  but  it 
seems  likely  the  association  of  the  erbB  receptors  will 
be  the  sum  of  many  interactions,  including  numerous 
weak  contacts,  distributed  throughout  all  the 
domains. 


Materials  and  Methods 

Cloning  and  expression 

Quikchange  mutagenesis  was  used  to  introduce  a  Smal 
site  into  the  petllb-SNGpA  plasmid  N-terminal  to  the 
GpA  TM  domain.  The  GpA  TM  sequence  was  excised 
from  the  SN  vector  containing  an  open  reading  frame  for 
a  staphylococcal  nuclease  C-terminal  GpA  fusion  by 
restriction  digest  with  either  Smal/BamHI  or  Xmal/ 
BamHI,  leaving  behind  the  reading  frame  for  staphylo¬ 
coccal  nuclease  and  a  C-terminal  linker.  The  erbB2 
transmembrane  sequence  was  generated  by  a  PCR  of  a 
pccB2  plasmid  template,  a  generous  gift  from  Dr  Mark 
Lemmon  and  Dr  Jeanine  Mendrola,  using  primers 
containing  Smal  and  Bglll  restriction  sites.  Bglll  creates 
an  overhang  compatible  with  BamHI  and  the  erbB2  TM 
sequence  contains  an  internal  BamHI  site.  The  PCR 
product  was  digested  and  ligated  into  the  SN  vector. 


The  other  erbB  sequences  were  generated  by  ordering 
synthetic  5'  phosphorylated  oligonucleotides  with  the 
appropriate  overhangs  from  Invitrogen  and  annealing  the 
oligonucleotides  by  heating  to  95  °C  with  subsequent 
cooling.  The  annealed  oligonucleotides  were  then  ligated 
into  the  SN  vector.  SnerbB2  variants  were  made  using  the 
Stratagene  Quikchange  protocol,  and  the  sequences  for  all 
constructs  were  verified  by  DNA  sequencing.  The 
SNerbB  plasmids  were  transformed  into  HMS174(DE3) 
cells,  and  expression  of  the  protein  was  induced  with 
IPTG.  The  SNerbB  proteins  were  purified  using  Thesit® 
detergent  according  to  the  protocol  published  for 
SNGpA.30'37 

SDS-PAGE  assay 

Cells  were  grown  to  an  A60q  of  0.8,  induced  with  1  mM 
IPTG  for  four  hours,  and  harvested  by  centrifugation.  Cell 
pellets  were  resuspended  in  SDS-PAGE  loading  buffer 
and  analyzed  using  a  20%  (w/ v)  acrylamide  gel  using  a 
Pharmacia  Phast  gel  system.  For  hetero-dimer  inter¬ 
actions,  equal  amounts  of  protein  at  25  pM  in  20  mM 
NaPi  7.0,  100  mM  NaCl,  and  2%  (w/v)  Thesit®  were 
mixed  and  incubated  for  30  minutes  at  37  °C.  The  samples 
were  then  analyzed  on  a  20%  acrylamide  Phast  gel. 

Sedimentation  equilibrium  analytical 
ultracentrifugation 

Purified  proteins  were  exchanged  into  the  neutrally 
buoyant  C8E5  detergent  by  ion-exchange  chromato¬ 
graphy  as  described.37  Sedimentation  equilibrium  experi¬ 
ments  were  performed  in  Beckman  XL-A  analytical 
ultracentrifuge  in  six-sector  cells  according  to  the  pre¬ 
viously  published  protocol  for  SNGpA,  with  the  excep¬ 
tion  that  the  protein  was  observed  at  280  nm.37  Data  were 
collected  at  three  initial  protein  concentrations  (A2bo=0.9, 
0.6,  0.3)  and  three  rotor  speeds  (20,000  rpm,  24,500  rpm, 
and  30,000  rpm).  WINMATCHf  was  used  to  check  that 
equilibrium  was  reached  at  each  speed.  For  the  data 
analysis,  protein  molecular  masses  and  partial  specific 
volumes  were  calculated  using  the  program  SEDNTERP.77 
Since  none  of  the  transmembrane  sequences  of  interest 
contained  tryptophan,  the  absorbance  of  the  TM  was 
considered  negligible  relative  to  the  SN  part  of  the  fusion 
and  an  extinction  coefficient  of  A28o  =  l  mg/ml  was  used. 
The  radial  distribution  profiles  for  the  nine  data  sets  were 
globally  fit  by  a  non-linear,  least-squares  curve-fitting 
procedure  in  the  Windows  version  of  NONLIN.47  The 
Akaike  information  criterion  (AIC)  was  calculated  using 
the  equation:46 

AIC  =  -N  In  ct2  -  2 P  (4) 

where  N  is  the  number  of  degrees  of  freedom,  o  is  the 
square-root  of  the  variance,  and  P  is  the  number  of 
parameters  in  the  model.  The  runs  test  score  was 
calculated  for  each  data  set  from  the  residuals  of  the 
global  fit  using  the  equations  described  earlier.78 

Glutaraldehyde  cross-linking 

Glutaraldehyde  (1%,  w/v)  in  buffer  was  added  to 
40  pM  protein  in  11  mM  C8E5,  20  mM  NaPi  7.0,  and 
200  mM  NaCl  to  a  final  concentration  of  0.1%  glutaralde¬ 
hyde.  Aliquots  were  taken  at  five,  ten,  30,  90,  and  120 
seconds  and  quenched  with  SDS-PAGE  loading  buffer. 
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Samples  were  boiled  for  five  minutes  and  analyzed  on 
12.5%  acrylamide  Phast  gels. 

Stochastic  distribution  data  simulation 

When  the  number  of  micelles  and  the  number  of 
protein  monomers  far  exceeds  the  occupancy  of  a  micelle, 
the  distribution  of  the  protein  into  the  micelles  due  to 
random  chance  can  be  described  by  a  Poisson  distri¬ 
bution: 

Lim(m,  n>>k ) :  Prob[X;  =  k]  =  ^  (— )  Vm/"  (5) 

k\  \nJ 

where  X,  is  the  random  variable  that  counts  the  number  of 
proteins  in  a  micelle  i,  k  is  the  number  of  proteins 
occupying  a  micelle,  m  is  the  number  of  protein 
molecules,  and  n  is  the  number  of  micelles.42  Given  a 
stochastic  distribution,  the  fraction  of  proteins  that  are  in 
a  particular  oligomeric  state,  fj  where  j  is  a  particular 
value  of  the  occupancy,  can  be  calculated  from  the 
following  partition  function,  which  considers  only  occu¬ 
pied  micelles: 

.  jProb[X,  =jj 
E»3tProb[X,=»J 

The  series  was  truncated  at  an  occupancy  of  k  =  5  because 
oligomers  larger  than  this  did  not  contribute  significantly 
to  the  distribution  at  micelle  to  protein  ratios  smaller  than 
1:1.  Additionally,  oligomers  larger  than  pentamer  would 
spin  out  at  the  speeds  used  in  the  sedimentation 
equilibrium  experiments  and  are  unobservable  in  the 
centrifuge.  To  calculate  the  species  distribution  as  a 
function  of  total  micellar  detergent  instead  of  the 
concentration  of  micelles,  the  ratio  m/n  was  divided  by 
the  aggregation  number.  An  aggregation  number  of  60, 
which  lies  between  the  aggregation  numbers  reported  for 
C8E4  and  C8E6,79  was  used. 
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CHAPTER  19 

Analysis  of  Membrane  Proteins 
Using  Analytical 
Ultracentrifugation 

KAREN  G.  FLEMING 

1  Introduction 

In  recent  years,  analytical  ultracentrifugation  (AUC)  has  played  an  important  role  in 
the  analysis  of  membrane  proteins.  Sedimentation  velocity  continues  to  be  an 
extremely  useful  tool  for  assessing  the  homogeneity  of  membrane  protein  prepara¬ 
tions  in  detergent  micelle  solutions;  however,  because  of  the  multiple  components 
present  in  a  membrane  protein: detergent  micelle  particle,  the  hydrodynamic  inter¬ 
pretation  of  the  sedimentation  coefficient  is  not  as  straightforward  as  it  is  with  solu¬ 
ble  proteins.  As  in  the  past,  sedimentation  equilibrium  continues  to  be  used  to 
determine  the  molecular  weights  of  many  membrane  proteins  when  dissolved  in 
detergent  micelles.  More  recently,  the  method  has  played  a  major  role  in  studies  on 
the  thermodynamics  of  subunit  interactions  between  membrane  proteins.  The  devel¬ 
opment  of  these  protocols  has  spumed  theories  describing  the  dependence  of  a 
membrane  protein  oligomeric  population  on  the  amount  of  detergent  as  well  as  novel 
studies  aimed  at  designing  membrane  proteins  with  desired  oligomerization  proper¬ 
ties.  The  bulk  of  the  contents  of  this  chapter  will  therefore  mainly  focus  on  the  issues 
related  to  measuring  and  interpreting  the  buoyant  molecular  weight  of  a  membrane 
protein  in  complex  with  detergent. 


1.1  General  Considerations  for  Sedimentation  Equilibrium 
Experiments  to  be  Carried  Out  on  Membrane  Proteins 

The  general  considerations  for  an  experiment  to  be  carried  out  on  a  membrane  pro¬ 
tein  are  similar  to  those  applicable  to  a  soluble  protein.  Since  sedimentation  equilib¬ 
rium  experiments  can  be  very  sensitive  to  small  amounts  of  contaminating  material, 
the  best  results  will  be  obtained  if  the  membrane  protein  preparation  is  >95%  pure 
as  judged  by  polyacrylamide  gel  electrophoresis.  Due  to  their  hydrophobic  nature, 
purified  integral  membrane  proteins  in  solution  must  be  studied  in  buffers  containing 
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a  detergent  micelle  or  lipid  vesicle  cosolvent,  and  the  vast  majority  of  AUC  experi¬ 
ments  are  executed  on  membrane  proteins  dissolved  in  detergent  micelles. 
Membrane  proteins  are  typically  analyzed  using  the  absorbance  optics  because  the 
interference  optics  would  also  contain  a  signal  from  the  detergent,  which  would 
introduce  uncertainty  into  monitoring  the  sedimentation  of  the  particle  of  interest. 
Therefore,  care  must  be  taken  to  avoid  buffer  components  that  have  significant 
absorbance  at  the  detection  wavelength  of  interest.  Unless  the  membrane  protein 
binds  nucleic  acids,  this  is  not  typically  a  concern  for  detection  at  280  nm  since  most 
commonly  used  buffer  components  are  transparent  at  280  nm.  However,  some  buffer¬ 
ing  components  and  reducing  agents  do  have  significant  absorbance  at  230  nm.  Their 
use  should  be  carefully  monitored  to  avoid  saturation  of  the  absorbance  optics  sys¬ 
tem  by  the  background  buffer.  Membrane  proteins  have  been  analyzed  in  the  stan¬ 
dard  2-  and  6-sector  cells  using  either  quartz  or  sapphire  windows.  Sapphire  windows 
are  not  always  transparent  at  230  nm,  and  the  user  should  empirically  determine  the 
transparency  for  each  set  of  windows. 

The  absorbance  properties  of  the  detergent  micelles  must  also  be  considered. 
Most  detergents  are  transparent  at  280  nm;  however  Triton  X-100  is  a  notable  excep¬ 
tion,  and  this  detergent  is  not  usually  the  optimal  choice  for  this  reason,  although 
there  have  been  studies  carried  out  in  low  concentrations  of  Triton  X-100.1  Reduced 
Triton  X-100  has  been  used  more  frequently  because  it  has  greater  transparency  at 
280  nm.  Neither  of  these  detergents  are  a  problem  for  membrane  proteins  contain¬ 
ing  chromophores,  like  heme,  when  the  absorbance  of  the  sedimenting  particle  can 
be  monitored  at  a  longer  wavelength.  Depending  on  the  chemical  structure,  some 
detergents  also  have  a  modest  absorbance  at  230  nm.  If  the  detergent  concentration 
can  be  maintained  such  that  this  contribution  is  small  (typically  C0.4OD),  then  their 
use  does  not  usually  pose  a  significant  experimental  problem. 

If  a  density-matching  strategy  (described  below)  is  to  be  used,  the  detergent 
micelle  environment  should  also  be  as  homogeneous  as  possible.  A  mixture  of  deter¬ 
gent  micelle  types  with  different  partial  specific  volumes  could  compromise  the 
integrity  of  the  density  matching  method,  since  a  membrane  protein  may  preferen¬ 
tially  bind  one  type  of  detergent  over  the  other.  To  obtain  a  homogeneous  detergent 
environment,  the  membrane  protein  may  be  purified  using  that  detergent. 
Alternatively,  ion  exchange  chromatography  can  be  used  to  exchange  one  detergent 
for  another.2-5  In  some  cases,  if  the  critical  micelle  concentration  is  high  enough,  the 
detergent  can  be  brought  to  equilibrium  with  the  solution  by  dialysis. 

1.2  Choice  of  Detergent  Micelle  Environment 

The  chemical  diversity  of  detergent  micelles  types  is  large,  and  not  surprisingly  the 
effects  of  any  particular  detergent  or  lipidic  environment  upon  any  particular  protein 
can  vary  widely.6  Both  the  type  and  concentration  can  have  profound  effects  on  the 
integrity  of  a  membrane  protein  and  on  the  stability  of  any  oligomers  that  the  protein 
may  form.5-7  In  addition  to  considerations  of  the  effects  of  the  detergent  on  the  mem¬ 
brane  protein  functional  properties,  there  may  be  experimental  constraints  on  the 
choice  of  detergent  micelles.  These  will  depend  on  the  type  of  analysis  to  be  carried 
out.  For  a  simple  molecular  weight  determination  of  a  membrane  protein  population 
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that  is  expected  to  be  homogeneous,  the  available  detergent  choices  are  large  since 
many  of  the  strategies  described  below  can  be  optimized  for  this  purpose.  However, 
in  recent  years,  much  progress  has  been  made  using  sedimentation  equilibrium  to 
determine  the  thermodynamics  of  interactions  between  membrane  proteins  in  deter¬ 
gent  micelle  environments.  For  these  thermodynamic  experiments,  the  detergent 
choices  are  limited  to  those  that  can  be  used  with  the  density-matching  approach. 
This  is  because  the  monomeric  and  oligomeric  forms  of  the  membrane  protein  may 
bind  different  amounts  of  detergent,  in  which  case  the  density  increment  would  be 
expected  to  be  concentration-dependent. 


2  Expressions  for  the  Buoyant  Molecular  Weight 

2.1  The  Buoyant  Molecular  Weight  Expressed  in  Terms  of  the 
Protein  Moiety  Alone 

In  the  case  of  a  membrane  protein,  the  sedimenting  particle  will  contain  contribu¬ 
tions  from  the  sedimentation  of  the  protein  as  well  as  from  any  bound  detergent  or 
lipid  molecules.  As  is  the  case  for  soluble  proteins,  the  sedimenting  particle  also  con¬ 
tains  a  contribution  from  bound  water,  although  it  is  negligible  under  most  experi¬ 
mental  conditions.  By  consideration  of  the  thermodynamics  of  multicomponent 
systems  it  has  been  shown  that  the  buoyant  molecular  weight  of  a  membrane  protein 
in  complex  with  its  hydrophobic  solvent  is  measured  in  a  sedimentation  equilibrium 
experiment  and  equals8 


MP(1  -  f  p)  = 


2RT I  dine  \ 
c 0 2  (  dr 2  ) 


(1) 


where  the  buoyant  molecular  weight  equals  A/P(l — <p'p).  In  the  above  equation  Mp  is 
the  molecular  weight  of  only  the  protein  portion  of  the  sedimenting  particle,  which 
excludes  the  molecular  weight  of  bound  detergent,  lipid  and  water,  <jf  the  effective 
partial  specific  volume  of  the  protein  moiety  in  the  sedimenting  particle  and  takes 
into  account  the  contributions  of  bound  detergent,  lipid  and  water;  p  the  solvent  den¬ 
sity,  R  the  gas  constant,  T the  absolute  temperature,  ©the  angular  velocity  (rad  s-1), 
c  the  concentration  of  the  sedimenting  particle,  and  r  the  radial  distance  from  the 
center  of  rotation. 

In  a  thermodynamically  ideal  solution,  it  has  been  shown  that  the  buoyant  molec¬ 
ular  weight  can  be  expressed  in  terms  of  its  components9,10 

MP(  1  -  =  MP(1  -  VpP)  +  In,M,(l  -  v,p)  (2) 

where  Equation  (2)  is  the  general  equation,  and  Mt  and  v,  are  the  molecular  weight 
and  partial  specific  volumes  (mL  g_1)  of  the  rth  component,  and  nl  is  the  number  of 
molecules  of  any  rth  component  bound  to  the  protein.  Since  most  analysis  of  mem¬ 
brane  proteins  is  carried  out  on  purified  proteins  dispersed  in  a  detergent  micelle 
solution,  we  can  explicitly  write  the  buoyant  molecular  weight  as 

MP(1  —  p  p)  =  Afpfl  —  vPp)  +  nDeIMDel(l  —  v Det  p)  +  nHpMH  0(  1  —  vHj0p)  (3) 
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where  the  subscripts  Det  and  H20  indicate  the  contributions  from  bound  detergent 
and  water  molecules,  respectively.  This  equation  is  also  written  in  the  form 

MP(1  —  (j)'  p)  =  Mp[(l  —  vPp)  +  <5De[(l  —  vDetp)  +  5Hp(l  —  vHj0p)]  (4) 

where  5f  represents  the  amount  of  the  ith  component  bound  in  grams  per  gram  of  pro¬ 
tein.  In  Equations  (3)  and  (4)  it  is  assumed  that  the  contribution  from  lipid  is  negligi¬ 
ble,  however  in  some  cases,  tightly  bound  lipid  molecules  will  co-purify  with  a 
membrane  protein.  If  the  number  of  lipids  is  large,  then  an  additional  term  account¬ 
ing  for  the  bound  lipid  must  also  be  included  in  Equations  (3)  and  (4);  however  small 
amounts  of  lipid  can  usually  be  ignored  because  the  partial  specific  volumes  for  lipids 
are  generally  near  unity.10  For  clarity  in  this  initial  discussion,  we  will  assume  that 
negligible  amounts  of  lipid  are  present,  and  examples  where  this  is  not  the  case  will 
be  discussed  later.  In  addition,  it  is  important  to  note  that  the  term  in  Equations 
(3)  and  (4)  represents  the  partial  specific  volume  of  the  detergent  when  it  is  bound  to 
the  protein,  which  is  one  of  the  fundamental  assumptions  in  this  analysis.  Tanford  and 
Reynolds10  has  argued  that  the  partial  specific  volume  of  detergents  above  the  critical 
micelle  concentration  should  be  used  because  this  represents  the  vDet  when  detergent 
is  in  a  self-associated  state,  as  it  would  be  when  associated  with  a  membrane  protein. 

From  Equations  (3)  and  (4),  it  can  be  seen  that  the  experimentally  measured  buoy¬ 
ant  molecular  weight  contains  information  about  both  the  membrane  protein  molec¬ 
ular  weight  and  the  molecular  weight  of  any  bound  detergent.  Depending  on  the 
scientific  question,  the  experimental  conditions  can  be  optimized  for  determining  the 
value  of  one  or  both  of  these  parameters. 


2.2  The  Buoyant  Molecular  Weight  Expressed  in  Terms  of  the 
Protein-Detergent  Complex 

An  alternative  theoretical  expression  for  the  buoyant  molecular  weight  of  the  sedi¬ 
menting  particle  was  first  used  by  Hersh  and  Schachman9  in  1958  and  assumes  that 
the  partial  specific  volume  of  the  complex  is  approximated  by  the  sum  of  the  vol¬ 
umes  of  the  protein  and  detergent.  This  theoretical  formalism  expresses  the  quantity 
measured  in  a  sedimentation  experiment  more  directly  in  terms  of  the  protein-deter¬ 
gent  particle: 

2RT  (  din  c  \ 

«c(i  -  vcp)  =  r J  (5) 


where  here  the  buoyant  molecular  weight  is  Mc(l  -  vcp),  with  Mc  being  the  molec¬ 
ular  weight  of  the  entire  protein  detergent  particle  which  equals  MP(  1  +  5Det)  and  vc 
equals  the  partial  specific  volume  of  the  entire  complex  and  can  be  written  as 


vP  +  ‘SteVDe, 


Vc  1  +  5, 


Det 


(6) 


Additional  terms  can  be  added  if  a  contribution  from  lipid  needs  to  be  taken  into 
account.  It  is  important  to  note  that  the  buoyant  molecular  weights  in  Equations  (1) 
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and  (5)  must  be  equal,  which  emphasizes  the  point  that  0'=/=  vc  because  MPj=Mc. 
Expression  (5)  is  included  here  because  there  is  sometimes  confusion  in  the  litera¬ 
ture  about  the  definition  of  the  buoyant  molecular  weight  for  a  membrane  protein  in 
detergent  solutions.  In  practice,  the  theoretical  formalism  in  Equation  (5)  is  not  used 
as  widely  and  will  not  be  discussed  extensively  in  this  chapter.  Tanford  and 
Reynolds10  has  pointed  out  that,  using  the  expression  in  Equation  (1),  the  quantity 
MP  may  be  determined  with  greater  accuracy  than  the  quantity  Mc  from  Equation  (5) 
because  vc  is  not  easily  measured.  In  contrast,  as  will  be  discussed  below,  the  quan¬ 
tity  (1  can  be  directly  determined  by  density  measurements. 

3  Sedimentation  Equilibrium  Methods  to  Determine 
Protein  Molecular  Weights  in  Protein-Detergent 
Complexes 

3.1  Determination  of  Membrane  Protein  Molecular  Weights  by 
Classical  Density  Matching 

The  molecular  weight  of  only  the  protein  portion  of  the  buoyant  molecular  complex 
can  be  determined  by  adjusting  the  experimental  conditions  such  that  the  net  contri¬ 
bution  of  the  detergent  becomes  negligible.  This  method  is  cartooned  in  Figure  1  and 
has  been  referred  to  as  “density  matching”  or  the  condition  of  rendering  the  detergent 
gravitational  transparent."  Density  matching  can  be  accomplished  by  modifying  the 
solvent  density  such  that  it  becomes  equal  to  the  effective  density  of  the  bound  deter¬ 
gent,  e.g.  p  =  l/vDet.  When  this  is  the  case,  the  term  (l-vq^p)  approaches  zero,  and 
the  contribution  from  the  detergent  term  in  Equation  (3),  n^M^il  —  v^p)  becomes 
negligible  regardless  of  the  number  of  bound  detergent  molecules. 

Thermodynamically,  the  most  favorable  experimental  condition  can  be  obtained 
when  the  density  of  the  solvent  can  be  modulated  using  a  mixture  of  light  and  heavy 
water.  This  experimental  condition  produces  the  most  minimal  perturbation  to  the 
contribution  of  water  in  the  buoyant  molecular  weight  of  the  sedimenting  species. 
When  the  density  matching  of  detergent  is  carried  out  using  mixtures  of  water,  the 
average  effective  density  of  any  bound  water  is  assumed  to  be  equal  to  the  density 
of  the  solvent.  In  this  case,  the  water  contribution  term  from  Equation  (3), 
Vh2oMh2o(1  — Vh2qP),  will  also  be  essentially  equal  to  zero.  The  result  of  such  a 
density  matching  strategy  is  that  Equation  (3)  can  be  reduced  to 

Mp(l  -  fp)  =  MP(1  -  vP p)  (7) 

Assuming  that  the  partial  specific  volume  of  the  protein  can  be  calculated  from  the 
amino  acid  composition  using  standard  procedures,12  the  buoyant  molecular  weight 
of  the  sedimenting  complex  can  be  directly  interpreted  in  terms  of  the  molecular 
weight  of  the  protein  portion  alone. 

To  match  the  effective  density  of  the  detergent  micelles,  the  most  commonly  used 
water  mixture  is  H20/D20.  Experiments  at  higher  solvent  densities  can  be  carried 
out  using  D2180,  although  this  has  been  used  less  extensively  due  to  the  expense  of 
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Figure  1  Cartoon  of  density  matching  strategy.  Away  from  the  density  match  point,  both  the 
protein  and  the  detergent  contribute  to  the  sedimentation  of  the  particle  at  equilib¬ 
rium.  If  the  solvent  density  is  adjusted  such  that  it  equals  the  effective  density  of  the 
bound  detergent  as  inlower  panel  of  the  figure,  the  bound  detergent  becomes  essen¬ 
tially  invisible  to  the  gravitational  field,  and  the  sedimentation  of  the  particle  is 
determined  by  the  protein  moiety  alone.  This  cartoon  represents  the  classic  density 
matching  strategy  where  the  density  is  matched  with  heavy  water,  and  so  no  distinc¬ 
tion  is  made  for  bound  water,  which  is  density -matched  in  both  cases  in  this  figure 

the  D2180  compound.13  Using  deuterium  water  mixtures,  there  is  a  small  mass  cor¬ 
rection  that  must  be  incorporated  into  the  analysis  due  to  the  exchange  of  deuterium 
for  hydrogen  on  exchangeable  protons.13,14  The  density  of  heavy  water  is  limited  to 
1.10392  and  1.21465  g  mL"1  for  100%  D20  and  100%  D2180,  respectively  (at 
20  °C).12  For  this  reason,  the  use  of  the  density-matching  strategy  places  limitations 
on  the  detergents  that  can  be  employed  in  sedimentation  equilibrium  experiments 
with  this  method.  To  match  the  effective  density,  the  partial  specific  volumes  of  the 
detergent  must  be  greater  than  0.9058  mL  g_1  to  match  with  D20  and  greater  than 
0.8232  g  mL-1  to  match  with  D2lsO.  It  is  also  important  to  keep  in  mind  that  deter¬ 
gents  and  lipids  with  partial  specific  volumes  greater  than  unity  will  have  effective 
densities  that  are  less  than  the  density  of  normal  water.  This  will  cause  a  negative 
contribution  to  the  buoyant  molecular  weight  and  be  manifested  as  a  tendency  to 
float  in  the  ultracentrifugation  experiment.15  Due  to  these  density  considerations,  the 
detergents  most  commonly  used  with  the  classic  density-matching  strategy  are  QE4, 
C8Ej,  CI2E8,  Cj4  sulfobetaine,  and  dodecylphosphocholine  (DPC). 

The  first  step  in  a  density-matching  experiment  is  to  establish  the  conditions  under 
which  the  detergent  micelles  are  gravitationally  transparent.  A  good  starting  point  is 
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the  consideration  of  the  partial  specific  volume  for  the  detergent  above  the  critical 
micelle  concentration,  and  the  partial  specific  volumes  for  many  detergents  are 
known.1046-18  When  the  partial  specific  volume  is  not  known,  it  can  be  determined 
by  at  least  two  methods.  Using  a  high-precision  densimeter,  direct  density  measure¬ 
ments  can  be  used  to  measure  the  partial  specific  volume  of  a  detergent  micelle  solu¬ 
tion  in  a  buffer  of  interest.17  Alternatively,  the  analytical  ultracentrifuge  can  be  used 
to  measure  the  buoyant  molecular  weight  of  the  detergent  micelles  alone  in  a  series 
of  solvents  whose  densities  differ.  Since  many  partial  specific  volumes  have  been 
tabulated,  this  approach  is  typically  used  to  confirm  the  percentage  of  D20  required 
for  matching  the  effective  density  of  the  detergent  micelle  in  the  context  of  the  other 
buffering  components.  At  the  density-match  point,  the  detergent  micelles  will  be  uni¬ 
formly  dispersed  within  the  cell.  At  slightly  lower  or  higher  solution  densities,  the 
distributions  are  usually  linear  with  positive  and  negative  slopes,  respectively.  For  a 
rigorous  analysis,  the  value  for  the  detergent  micelle  buoyant  molecular  weight  can 
be  determined  using  Equation  (1),  and  the  detergent  micelles  are  density  matched 
when  this  buoyant  molecular  weight  equals  zero.  It  is  optimal  to  use  interference 
optics  to  detect  the  detergent  micelle  radial  distribution  when  the  detergent  micelles 
are  transparent  at  accessible  wavelengths.  Although  the  micelles  can  be  visualized 
using  a  hydrophobic  dye  that  partitions  into  them,  we  have  found  that  even  a  small 
amount  of  dye  can  alter  the  micelle  density. 

3.2  Examples  of  Density  Matching 

Several  groups  have  used  the  density-matching  strategy  to  determine  molecular 
weights  of  membrane  proteins.  Classic  work  by  Reynolds  and  Tanford  first  demon¬ 
strated  the  usefulness  of  the  density-matching  strategy.10'1819  Since  then,  many  stud¬ 
ies  have  been  carried  out  with  increasing  frequency  in  the  past  few  years.  Several 
groups  have  taken  advantage  of  the  properties  of  the  CgE4  or  C8E5  detergents,  which 
are  neutrally  buoyant  and  require  no  D20  for  density  matching.  The  molecular 
weights  of  membrane  proteins  with  diverse  folds  including  monotopic  and  polytopic 
a-helical  membrane  proteins  as  well  as  transmembrane  /3-barrel  proteins  have  been 
analyzed  using  these  neutrally  buoyant  detergent  micelles.20-26  C12Eg  micelles  have 
also  been  used  in  the  analysis  of  membrane  proteins  by  sedimentation  equilibrium. 
In  one  study,  its  effective  density  was  matched  with  18%  D20.27  In  a  different  study 
the  relative  mass  contribution  due  to  bound  C12Eg  was  assumed  to  be  small  and  no 
D20  was  added.28  More  recently,  two  groups  have  used  C14  sulfobetaine  micelles 
matched  with  13%  D20  or  DPC  micelles  matched  with  50%  D20  to  determine  the 
protein  molecular  weights  of  both  natural  and  designed  membrane  protein 
sequences.5,29-36  The  density-matching  strategy  has  especially  found  utility  in  the 
thermodynamic  analysis  of  membrane  protein  interactions. 

3.3  When  Heavy  Water  is  not  Heavy  Enough 

The  effective  densities  of  some  of  the  most  widely  used  types  of  detergent  micelles 
exceed  that  of  D20  and  therefore  cannot  be  matched  using  heavy  water.  This  class 
includes  dodecylmaltoside  (DDM)  and  /3-D-octylglucoside  (/30G),  which  are  both 
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commonly  used  to  dissolve  membrane  proteins  for  functional  and  structural  studies.6 
When  it  is  optimal  for  the  membrane  protein  of  interest  to  be  solubilized  in  these  types 
of  detergents  or  other  detergents  with  partial  specific  volumes  £0.9  mL  g_1,  alterna¬ 
tive  strategies  to  density  matching  must  be  used  to  disentangle  the  relative  contribu¬ 
tions  of  the  protein  and  detergent  contributions  to  the  buoyant  molecular  weight.  Two 
strategies  for  the  analysis  of  membrane  proteins  with  density  agents  will  be  discussed. 

3.3.1  Extrapolation  to  the  Match  Point 

The  first  method  involves  determining  the  buoyant  molecular  weight  in  several  dif¬ 
ferent  mixtures  of  heavy  water  followed  by  extrapolation  to  the  density  at  which  the 
detergent  micelles  are  gravitationally  transparent.19  If  the  protein-detergent  complex 
is  not  altered  by  changing  the  H20/D20  (or  D2180)  ratio,  the  buoyant  molecular 
weight  of  the  sedimenting  complex  will  be  a  linear  function  of  solvent  density.  A  line 
can  be  fitted  and  at  the  match  point  for  the  effective  density  of  the  detergent,  the 
value  of  the  buoyant  molecular  weight  can  be  interpreted  in  terms  of  the  protein  por¬ 
tion  alone  using  Equation  (3)  since  the  two  right  terms  in  Equation  (3)  are  negligi¬ 
ble  at  this  match  point.  A  good  example  of  this  method  is  shown  in  Figure  2.  Suarez 
et  alP  used  this  extrapolation  approach  to  measure  the  partial  specific  volume  of 
DDM  micelles  as  well  as  the  molecular  weights  of  both  heart  and  rat  liver 
cytochrome  oxidase  dissolved  in  the  same  detergent.  As  can  be  seen  in  Figure  2,  the 
extrapolated  line  for  the  buoyant  molecular  weight  of  DDM  micelles  crosses  zero  at 
a  solvent  density  of  1.22  g  mL-1.  At  this  same  point,  which  is  indicated  by  the  dot¬ 
ted  vertical  line,  the  buoyant  molecular  weights  of  the  cytochrome  oxidase  proteins 
are  still  positive,  reflecting  the  sedimentation  of  the  protein  portion.  At  this  extrapo¬ 
lated  match  point  the  molecular  weight  of  the  protein  portion  alone  can  be  calculated 
using  Equation  (4)  by  knowing  the  protein  partial  specific  volume,  which  can  be  cal¬ 
culated  from  the  amino  acid  composition.12  37  This  extrapolation  method  has  the  dis¬ 
advantage  of  requiring  an  extrapolation  to  a  match  point  that  is  not  experimentally 
accessible,  however  it  can  be  accurate  if  sufficient  data  are  collected  to  describe  the 
line  and  if  the  extrapolation  is  not  too  long.  In  fact,  Reynolds  and  mcCaslin18  has 
even  used  this  method  to  estimate  the  molecular  weight  of  bovine  rhodopsin  in 
sodium  cholate  micelles,  whose  partial  specific  volume  is  nearly  that  of  proteins.18 

3.3.2  Determination  of  Membrane  Protein  Molecular  Weights  by 
Density  Matching  the  Hydrated  Detergent  Micelle  with 
Density  Agents 

A  second  experimental  approach  recently  employed  is  the  use  of  molecules  to  increase 
the  solution  density,  which  will  be  referred  to  as  densifier  molecules.  This  experimen¬ 
tal  approach  is  most  successfully  implemented  when  moderate  amounts  of  the  density 
agents  can  successfully  match  the  sedimentation  of  the  micelles.  Nycodenz,  sucrose, 
and  glycerol  have  all  been  employed  in  several  different  studies.11’38,39  Since  the  addi¬ 
tion  of  these  densifier  compounds  can  lead  to  preferential  binding  or  exclusion  of 
water,  the  effective  density  of  the  detergent  usually  contains  a  contribution  from  bound 
water.  For  this  reason,  this  method  is  sometimes  referred  to  as  the  density-matching  of 


LSD_AUC_Chapter  19.qxd  6/4/2005 


1:04  PM 


440 


Chapter  19 


Figure  2  An  example  of  the  method  of  extrapolation  to  the  density  match  point  using  heavy 
water.  The  buoyant  molecular  weights  for  beef  heart  cytochrome  oxidase  ( open 
circles),  rat  liver  cytochrome  oxidase  (closed  circles),  and  DDM  micelles 
(squares)  were  determined  as  a  function  of  solvent  density.  The  dotted  vertical  line 
indicates  the  X-intercept  for  the  linear  fit  to  the  sedimentation  of  DDM  micelles; 
at  which  point  the  DDM  micelles  are  density-matched.  The  increased  mass  for  the 
two  protein  samples  at  this  solvent  density  reflects  the  sedimentation  due  to  the 
protein  portion  of  the  sedimenting  complex  for  those  oxidase  proteins.  Using  the 
protein  molecular  weight  and  partial  specific  volume  for  DDM  determined  at  the 
match  point  for  DDM,  the  amount  of  detergent  bound  to  each  protein  can  be  cal¬ 
culated  from  the  buoyant  molecular  weights  at  the  other  solvent  densities  using 
Equation  (3).  Reproduced  with  permission  from  Suarez  et  al.13 


the  hydrated  detergent  micelles.  Like  the  classic  density-matching  approach  carried 
out  with  H20/D20  mixtures,  the  first  step  with  this  experimental  approach  is  to  estab¬ 
lish  the  density  matching  condition.  As  shown  in  Figure  3,  this  is  accomplished  by  car¬ 
rying  out  a  sedimentation  equilibrium  experiment  on  detergent  micelles  alone  as  a 
function  of  density,  where  the  density  is  modulated  by  the  concentration  of  the  density 
agent.  Like  the  original  method,  the  effective  density  of  the  hydrated  detergent  micelle 
is  matched  when  the  buoyant  molecular  weight  equals  zero.  It  has  been  found  that  the 
effective  density  of  the  hydrated  detergent  micelle  varies  significantly  as  a  function  of 
density  agent,  and  it  is  generally  lower  than  the  effective  density  in  H20/D20  solutions, 
demonstrating  the  effect  of  the  bound  water. 

In  principle,  the  protein  moiety  may  also  experience  preferential  hydration  or  exclu¬ 
sion  of  water,  and  this  will  be  reflected  in  the  effective  partial  specific  volume  term  for 
the  protein.  In  practice,  this  contribution  has  been  observed  to  be  small  as  long  as  mod- 
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Figure  3  An  example  of  establishing  the  density-matching  conditions  for  CnE9  micelles 
using  sucrose  or  glycerol.  The  equilibrium  sedimentation  of  0.3%  (w/v)  C12E9 
micelles  was  measured  using  interference  optics  in  the  presence  of  varying  amounts 
of  sucrose  (open  squares),  glycerol  (open  triangles)  or  DwO  (open  circle).  At  equi¬ 
librium,  the  effective  mass  of  the  hydrated  Cl2E9  micelles  was  determined  as 

^eff  ^DetH 

Reproduced  with  permission  from  Mayer  et  al.38 

erate  levels  of  the  density  agent  are  used.  Lustig  et  al.n  used  an  empirical  approach  to 
estimate  the  contribution  of  water  to  the  partial  specific  volumes  of  six  different  mem¬ 
brane  proteins  and  obtained  protein  molecular  weights  that  agreed  within  ±  15%  of  the 
known  values  in  both  DDM  and  /30G  micelles.  Lebowitz  and  co-workers  used  reduced 
Triton  X-100  micelles  density  matched  with  sucrose  to  demonstrate  the  oligomeric 
state  for  simian  immunodeficiency  viral  envelop  protein  particles.39  These  authors 
assumed  a  negligible  contribution  of  bound  water  to  the  protein  moiety  and  still  were 
able  to  demonstrate  a  trimeric  stoichiometry  for  the  viral  envelop  protein  particles. 
Mayer  et  al .38  showed  in  C12E9  and  reduced  Triton  X-100  that  the  partial  specific  vol¬ 
umes  for  the  cytochrome  c  oxidase  protein  depended  only  slightly  on  the  nature  of  the 
density  agent  and  were  similar  to  the  value  obtained  in  H20/D20  mixtures.  However, 
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this  same  group  found  that  experiments  in  DDM  micelles  were  difficult  to  interpret 
when  matched  with  either  sucrose  or  glycerol  alone  in  water.38  Even  in  a  subsequent 
study40  the  combination  of  sucrose  with  D20,  which  reduced  the  sucrose  requirement, 
still  required  some  assumptions  about  the  contribution  of  hydration  to  the  protein  moi¬ 
ety  in  the  presence  of  DDM  micelles  since  the  use  of  the  partial  specific  volume  cal¬ 
culated  from  the  composition  using  the  values  of  Cohn  and  Edsall37  returned  a  protein 
molecular  weight  differing  from  the  known  value  by  a  significant  amount.  In  sum,  if 
the  hydrated  detergent  micelles  can  be  matched  by  moderate  amounts  of  density  agent, 
this  density-matching  approach  appears  a  promising  method  for  molecular  weight 
determinations  in  a  wide  variety  of  detergent  micelle  types. 

Mayer  and  co-workers  have  pointed  out  that  many  of  these  densifier  agents  are 
known  to  stabilize  proteins,  and  this  may  offer  an  additional  advantage  due  to  the  rel¬ 
atively  long  periods  of  time  required  for  sedimentation  equilibrium  experiments.38 
On  the  other  hand,  significant  amounts  of  densifier  agent  can  increase  the  solvent 
viscosity,  which  can  increase  the  time  to  equilibrium.  Thus,  the  advantages  of  this 
approach  may  depend  on  the  molecular  properties  of  the  membrane  protein  in  a  par¬ 
ticular  detergent  micelle  environment. 

3.4  Determination  of  Membrane  Protein  Molecular  Weights  by 
the  Analysis  of  Density  Increment 

An  alternative  to  density  matching  is  a  consideration  of  the  buoyant  molecular 
weight  of  the  sedimenting  particle  directly  in  terms  of  the  density  increment.  As 
defined  by  Casassa  and  Eisenberg,8  in  a  thermodynamic  consideration  of  multicom¬ 
ponent  systems,  the  density  increment  for  the  macromolecule  at  constant  chemical 
potential  of  water  and  all  diffusible  solutes  (including  detergent)  equals 

(|r)„ '  (1~<w  <8) 

where  c2  is  the  weight  concentration  of  the  protein  alone,  and  the  change  in  the  solu¬ 
tion  density  as  a  function  of  changing  the  protein  concentration  gives  the  density  incre¬ 
ment.  This  term  (1  -<p'p0)  is  identical  to  the  buoyancy  factor  given  in  Equation  (1) 
where  <f>'  is  defined  as  before.  When  multiplied  by  the  molecular  weight  of  the  protein 
moiety  alone  as  in  Equation  (1),  this  quantity  returns  the  buoyant  molecular  weight  of 
the  entire  sedimenting  complex.  The  advantage  of  analysis  using  the  density  increment 
method  is  that  the  quantity  (dp/dC2)fi  can  be  directly  and  independently  measured  on  a 
protein  solution  at  dialysis  equilibrium  with  all  other  components  using  a  high-preci¬ 
sion  densimeter.  As  little  as  two  density  measurements  on  solutions  in  the  presence  and 
absence  of  protein  has  been  used  to  calculate  the  subsequent  slope,  which  is  applied  to 
the  sedimentation  results  to  calculate  the  accurate  of  the  molecular  weights  membrane 
protein.  As  long  as  accurate  and  precise  density  measurements  can  be  made,  this 
method  greatly  expands  the  repertoire  of  detergent  micelles  that  can  be  used  in  a  sed¬ 
imentation  equilibrium  experiment  since  no  density-matching  considerations  are  nec¬ 
essary.  In  addition,  this  approach  works  well  for  membrane  proteins  dispersed  in 
mixtures  of  different  types  of  detergent  micelles  as  well  as  detergent/lipid  mixtures, 
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since  there  is  no  uncertainty  due  to  preferential  binding  of  one  of  the  hydrophobic 
components. 

3.5  Examples  of  Use  of  the  Density  Increment 

Butler  and  co-workers  have  used  this  approach  to  determine  the  average  molecular 
weights  of  membrane  proteins  dispersed  in  a  wide  variety  of  detergent  micelles.  The 
versatility  of  the  method  has  been  demonstrated  for  membrane  proteins  samples  hav¬ 
ing  effective  partial  specific  volumes  ranging  from  -0.14  to  0.734  mL  g_1  41,42  The 
density  increment  method  overcomes  many  of  the  difficulties  described  above  asso¬ 
ciated  with  using  the  class  of  micelles  containing  maltoside  head  groups,  as  several 
membrane  proteins  have  been  measured  in  either  DDM  or  decylmaltoside  (DM).43 
The  stoichiometry  of  the  bovine  major  intrinsic  protein  was  shown  to  be  tetrameric  in 
DM,  one  of  the  few  detergents  that  stabilized  the  complex,44  and  the  versatility  of  the 
density  increment  approach  is  demonstrated  by  recent  studies  aimed  at  determining 
the  oligomeric  state  of  the  EmrE  multidrug  transporter  solubilized  in  DDM  deter¬ 
gent.41  Two  forms  of  the  protein  were  prepared.  A  “high  molecular  weight”  form  that 
was  purified  using  minimal  detergent  concentrations  as  well  as  a  “lower  molecular 
weight”  form  purified  using  higher  concentrations  of  DDM.  Sedimentation  equilib¬ 
rium  demonstrated  that  both  forms  of  the  EmrE  were  in  fact  dimeric.  In  complemen¬ 
tary  work  that  determined  the  amount  of  DDM  bound  to  EmrE,  it  was  found  that  the 
“high  molecular  weight”  form  of  EmrE  was  not  a  different  oligomeric  state,  but  in 
fact  had  ~  10-fold  more  lipid  bound.  This  additional  lipid  increased  the  apparent  size 
as  measured  by  gel  filtration  and  sedimentation  velocity  and  obscured  true  molecular 
weight  analysis  using  those  methods. 

3.6  Thermodynamics  of  Membrane  Protein  Association 

The  origins  of  the  specificity  and  free  energy  of  interaction  between  membrane  pro¬ 
teins  are  not  well  understood.  Probably  the  largest  impact  that  sedimentation  equilib¬ 
rium  has  had  on  membrane  proteins  in  recent  years  has  been  due  to  its  use  in 
experimental  measurements  in  this  area  of  research.  While  it  had  been  widely  appre¬ 
ciated  that  the  properties  of  micelles  influenced  the  functional  and  oligomeric  states  of 
membrane  proteins,  this  knowledge  was  derived  from  empirical  observations  of  the 
effects  of  particular  detergents  on  specific  proteins.6  Using  sedimentation  equilibrium, 
Fleming  and  co-workers  have  shown  that  the  free  energy  of  association  between  mem¬ 
brane  proteins  can  be  measured  in  detergent  micelle  solutions  as  long  as  the  kinetics 
of  rearrangement  are  fast  relative  to  the  time  for  sedimentation  equilibrium.2’4’27 

The  experimental  advances  lead  to  several  theories  describing  the  concentration 
dependence  of  interactions  in  micelles.4’5’28’45  Under  the  assumption  that  a  self-associ¬ 
ation  reaction  for  an  integral  membrane  protein  will  take  place  within  the  micellar 
environment,  it  has  been  shown  in  two  different  detergents  that  the  apparent  equilib¬ 
rium  constant  for  a  monomer-dimer  reaction  varies  in  a  systematic  way  as  a  function 
of  detergent  concentration.4  This  apparent  equilibrium  constant  is  derived  from  the 
sedimentation  equilibrium  data  using  the  protein  concentration  in  bulk  molar  units.  As 
can  be  seen  in  Figure  4,  the  free  energy  of  association  will  become  less  favorable  if  the 
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in  [Micellar  C8E5]W  (Molar) 

Figure  4  The  apparent  free  energy  of  association  for  a  membrane  protein  monomer-dimer 
reaction  decreases  as  the  size  of  the  micelle  phase  increases.  The  apparent  free 
energy  of  association  is  plotted  as  a  function  of  the  logarithm  of  the  micellar 
detergent  concentration  expressed  on  the  aqueous  scale.  Under  the  ideal  dilute 
assumption,  the  data  should  vary  linearly  with  a  slope  of  RT.  The  linear  regres¬ 
sions  of  three  independent  experiments  an  average  slope  of  0.62  (RT—0.59  at 
25  °C)  Experiments  in  CgEs  micelles  were  carried  out  on  the  glycophorin  A  trans¬ 
membrane  helix  expressed  as  a  fusion  protein  with  Staphylococcal  nuclease. 
Reproduced  with  permission  from  Fleming,  20024 

size  of  the  micellar  phase  increases  because  the  membrane  protein  population  experi¬ 
ences  an  increase  in  volume  of  the  reaction  phase.  The  large  number  of  data  points 
available  in  a  sedimentation  equilibrium  experiment  coupled  with  modem  computa¬ 
tional  power  allowing  global  fits  of  different  initial  starting  conditions  are  two  advan¬ 
tages  of  using  the  analytical  ultracentrifuge  for  these  studies  over  alternative  methods. 

The  availability  of  protocols  to  quantify  the  free  energy  of  association  between 
integral  membrane  proteins  in  detergent  solutions  has  catalyzed  a  new  area  of 
research  in  the  membrane  protein  field.  Several  mutagenesis  studies  on  model  sys¬ 
tems  as  well  as  natural  membrane  proteins  have  begun  to  address  the  physical  princi¬ 
ples  underlying  oligomeric  stability  of  membrane  proteins.2’25,26’36  These  principles 
are  being  tested  by  membrane  protein  design  efforts  whose  success  are  evaluated  in 
large  part  by  sedimentation  equilibrium  analysis  of  the  designed  sequences.29-33,46 

3.7  Determination  of  the  Amount  of  Detergent  Bound  in  a 
Membrane  Protein  Complex 

In  the  characterization  of  a  membrane  protein  complex,  the  amount  of  detergent 
bound  to  a  membrane  protein  may  be  of  interest  for  a  particular  scientific  question. 
Implicit  in  many  of  the  discussions  above  is  the  fact  that  sedimentation  equilibrium 
experiments  carried  out  at  densities  away  from  the  detergent  match  point  contain  a 
measurable  contribution  from  the  bound  detergent.  Once  the  molecular  weight  of  the 
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protein  moiety  has  been  determined,  Equation  (3)  can  be  used  to  calculate  the  quan¬ 
tity  nDet  as  long  as  the  contribution  from  hydration  is  small.13  More  recently,  modem 
computers  have  been  used  to  globally  fit  the  primary  sedimentation  equilibrium  data 
collected  at  different  D20  concentrations  directly  for  this  parameter.47 

4  Sedimentation  Velocity  Analysis 

While  most  of  the  recent  advances  in  experimental  methods  for  analysis  of  mem¬ 
brane  proteins  have  come  in  the  area  of  sedimentation  equilibrium,  sedimentation 
velocity  still  remains  an  extremely  useful  tool.  Since  finding  optimal  conditions  for 
solubilization  of  a  membrane  protein  can  be  an  arduous  task,  sedimentation  velocity 
continues  to  be  a  fairly  quick  experimental  approach  for  evaluating  the  homogene¬ 
ity  of  a  sample.11,48,49  A  good  example  of  this  approach  is  shown  in  Figure  5,  where 
Musatov  et  al.  used  the  van  Holde-Weischet  analysis  method  to  evaluate  the  sedi¬ 
mentation  coefficient  distribution  for  cytochrome  bo3  as  a  function  of  the  concen¬ 
tration  of  Triton  X-100.  The  sedimentation  coefficient  distribution  showed  that  the 
protein  aggregated  when  the  detergent  concentration  was  low  (open  squares).  In  con¬ 
trast,  a  homogeneous  sedimentation  coefficient  distribution  was  found  at  deter¬ 
gent/protein  ratios  >2:1  (w/w).  Subsequent  sedimentation  equilibrium  analysis 
demonstrated  that  the  experimental  conditions  producing  the  homogeneous  popula¬ 
tion  represented  a  monomeric  form  of  the  cytochrome  bo 3  membrane  protein. 

A  similar  question  of  homogeneity  was  of  interest  in  a  study  by  MacPhee  et  al., 
who  used  sedimentation  velocity  to  characterize  the  interactions  of  lipoprotein  lipase 


Figure  5  An  example  of  sedimentation  velocity  analysis  to  determine  homogeneity  in  a 
membrane  protein  preparation.  Sedimentation  velocity  data  collected  at  27  000 
rpm  were  analyzed  using  the  van  Holde-Weischet  method.  Each  line  of  symbols 
represents  the  distribution  of  sedimentation  coefficients  across  the  boundary. 
Cytochrome  bo3  (0.5  mg  protein  mL~!)  was  solubilized  in  50  mM  potassium  phos¬ 
phate  (pH  8.1)  containing  (a)  1.0  mg  mL~ 1  Triton  X-100  (open  squares),  (b)  5  mg 
mL~‘  Triton  X-100  (filled  circles)  or  (c)  10  mg  mL Triton  X-100  (open  circles ). 
The  near  identity  of  sedimentation  coefficients  across  the  boundary  as  in  samples 
b  and  c  indicate  sample  homogeneity.  Subsequent  sedimentation  equilibrium 
analysis  demonstrated  a  monomeric  population  for  cytochrome  bo3  under  condi¬ 
tions  where  the  sedimentation  coefficient  indicated  homogeneity.  Reproduced  with 
permission  from  Musatofet  al.49 
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with  homogeneous  lipid  emulsions.15  An  interesting  aspect  of  this  work  is  that  the 
density  of  the  lipid  emulsions  is  less  than  that  of  water.  Therefore,  the  sedimentation 
velocity  experiment  was  used  to  evaluate  the  flotation  gradients  formed  upon  appli¬ 
cation  of  the  gravitational  potential.  This  resulted  in  the  quantification  of  complexes 
in  a  specialized  version  of  the  time  derivative  software50  by  defining  a  flotation  coef¬ 
ficient,  which  corresponds  to  the  flotation  equivalent  of  a  sedimentation  coefficient. 
Flotation  equilibrium  studies  were  also  carried  out  in  this  study. 

5  The  Future  for  Membrane  Proteins  and  the 
Analytical  Ultracentrifuge 

As  the  development  of  expression  systems  to  generate  purified  membrane  proteins 
continues,  an  increasing  number  of  membrane  protein  samples  will  be  available  for 
molecular  weight  and  thermodynamic  analysis  using  sedimentation  equilibrium.  The 
increase  in  availability  of  modem  ultracentrifuges  will  make  the  method  more  acces¬ 
sible,  and  sedimentation  velocity  will  continue  to  find  utility  as  a  method  for  screen¬ 
ing  membrane  protein  preparations  for  homogeneity.  The  exponential  increase  in 
high-resolution  structural  studies  on  membrane  proteins,51  most  of  which  are  carried 
out  in  detergent  micelles,  will  stimulate  solution  studies  on  this  class  of  proteins  in 
order  to  test  functional  questions  arising  from  the  structures.  The  ultracentrifuge  will 
continue  to  be  a  biophysical  tool  of  choice  for  answering  many  of  them. 
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